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2-to-26.5-GHz Synthesized Signal 
Generator Has Internally Leveled Pulse 
Modulation 

This second-generation instrument features 
microprocessor control , sophisticated sweep capabilities, 
programmability, and enhanced serviceability. 

by Wifliam W, Heinz and Paul A. Zander 



BROADBAND, synthesized microwave signal gener- 
ators offer the stability, frequency accuracy, and 
spectral purity of a synthesizer together with the 
level accuracy and AM and FM capabilities of a signal 
generator. They have found numerous applications in 
coram unications and radar testing and simulation. Pro- 
grammability has generated widespread use of these in- 
struments in automatic test systems. 

Since the introduction of the HP 8672 A Synthesized Sig- 
nal Generator in 1976, 12 " 3 increasing user sophistication 
and demands for performance enhancements have led to 
the next generation in this instrument family. The new 
8673A Synthesized Signal Generator (Fig. 1] covers the 
2-to-26.5-GHz frequency range and features internally 
leveled pulse modulation capability. The addition of mi- 



crocomputer control provides keyboard operation, data 
entry from the front panel, digital sweep capability, and 
many other features. Calibrated output levels from -100 
dBin to -r8 dBm are available over the 2-to-18-GHz fre- 
quency range, Maximum power is +4 dBm to 22 GHz and 
dBm to 26 GHz. 

[failure rates are of major concern to users of modem 
sophisticated instruments. The excellent reliability of the 
8672A has not been compromised in the design of the 
8673A. A number of service features and diagnostics have 
been incorporated to reduce repair time. 

System Operation 

The organization of the 8673A is similar to that of the 
8672A. The digital control unit (DCU) f containing a mi- 
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Fig. 1, The HP 8673A Synthe- 
sized Signal Generator features 
metered AM, low-distortion FM, 
and high-performance pulse 
modulation. Its frequency range is 
2 to 26.5 GHz, 



MAY 1983 HEWLETT-PACKARD JOURNAL 3 



)Copr. 1949-1998 Hewlett-Packard Co. 



1.95-6,625 GHz 
Input 



Power 
Amplifier |so , B|or YTM 




Input 



1. 95-26. 5GHz 
Buffer Logarithmic Output 

Amplifier 

Reference Voltage 



Logarithmic 
Amplifier 



Buffer 



coprocessor, directs the internal oscillators, switches, at- 
tenuators and digital-to-analog converters (DACs) to ap- 
propriate operating points to produce the frequency, level, 
and modulations requested by the user either via the front 
panel or remotely via the MP- IB (IEEE 488), The frequency 
synthesis section is similar to that of the 8672 A . This sec- 
tion includes a 2-to>G.625-GHz YIG-lu .ned * oscillator ( YTO) 
which is phase-locked to signals derived from a stable 
10-MHz reference oscillator. The RF oulput section con- 
tains a YIG-tuned multiplier (YTM) which either passes the 
amplified YTO signal or multiplies it by 2. 3, or 4 to cover 
the 2-to-26.5-GHz frequency range (see article, page 10). A 
block diagram of the RF output section is shown in Fig. 2, 
The signal from the YTO is amplified and passed through a 
pin diode modulator for amplitude modulation and level 
control (ALC). Pulse modulation is performed in the sub- 
sequent series/shunt diode modulator before power 
amplification and multiplication in the YTM. 

A number of benefits accrue from this configuration, in 
which pulse modulation is performed ahead of the multi- 
plier. Adding a modulator after the YTM would absorb 
valuable power, particularly at the higher frequencies. The 
design of such a component to 26 GHz would be difficult, 
especially if the desired pulse on' off ratio of 80 dB is to be 
maintained. Another advantage of modulation ahead of the 
multiplier is the virtual elimination of pulse video feed- 
through by the filtering action of I he Y1G filter in the YTM. 
A disadvantage of this system is the deterioration of pulse 
rise time through the YTM. The approach used in solving 
this problem is discussed in the article on page 10. 

The multiplier is driven by a broadband GaAs FET power 
amplifier which produces a typical output power in excess 
of +27 dBm. 4 After multiplication in the YTM, the signal 
passes through a broadband directional coupler that has a 
leveling detector at the coupled port. The detected dc volt- 
age is fed through a logarithmic amplifier and summed into 
the ALC loop. In pulse mode, the ALC loop error signal is 
generated by sampling during the on time of the pulse and 
holding between pulses. To achieve accurate leveling of 
pulses down to 100-ns pulse widths, the sampling gate 
window must include only the flat top of the sampled pulse, 
yet be as broad as possible to maximize effective duty cycle. 

*Ttae treq u en cy-a^errriinirig element is a spfiere of yttrium-inon -garnet in a magnetic Mid- 
i's resonant frefjuoftcy is tunerj by varying wie magneiic fiefd strengm. 



Fig. 2. Block diagram of the 
8673A RF output section showing 
the major microwave components 
and the automatic level control 
(ALC) circuitry 



Thus the rise time of the delected pulse must not be de- 
graded through the detector and logarithmic amplifier. 
This requires a low bypass capacitance (.3 pF| for the detec- 
tor and sufficient bandwidth for the logarithmic amplifier. 
The actual gain-bandwidth product exceeds 500 MHz. 

After the leveling coupler* the signal passes through a 
step attenuator which provides a maximum of 90 dB of 
attenuation in 10-dB steps, Continuous control of power 
levels between steps is provided by the ALC loop reference 
voltage which is adjustable by means of the vernier knob on 
tlii- front panel. In remote operation, a DAC provides the 
reference voltage in 0/1 -dB steps, 

Performance 

The instrument performance specifications are pub- 
lished only after characterization of a sufficient number of 
instruments to provide meaningful statistical data. The 
specification allows for measurement uncertainties (usu- 
ally calculated to achieve 95% confidence levels), tempera- 
ture and environmental variations, and any potential drift 
that may occur over time. For these reasons, measured 
room- temperature performance may be better than a 
specification might indicate. 

Maximum power output of a signal generator is of major 
concern to users interested in using the instrument as a 
local oscillator or where measurement system losses are 
high, Fig. 3 is a plot of maximum output power obtained 
from ten production 8673 A instruments together with 
specified power as a function of frequency. The graph 
shows the margin between the specifications and the power 
available at room temperature. 

The level accuracy of the instrument is important to those 
measuring receiver sensitivities or the transmission charac- 
teristics of amplifiers or other devices. Fig. 4 shows level 
accuracy plots for ten production instruments for several 
ranges down to -1 00 dBm, In view of the excellent results 
obtained, it was decided that microprocessor correction of 
output level was not needed, eliminating the disadvantages 
of requiring new ROMs when components after the YTM 
are replaced. 

Fig. 5 shows typical pulse performance at 26 GHz. 
Specified maximum rise time is 35 ns and maximum over- 
shoot is 20%. Leveling accuracy down to 100-ns pulse 
widths is ±1 dB relative lo CW. To achieve 80-dB on'off 
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ratio, careful shielding of components is required. Leakage 
or radiation from components ahead of the pulse modulator 
is kept low enough not to leak back and be amplified by the 
power amplifier's 25 dB of small-signal gain, 

Program friability 

The 867 3 A is fully programmable via the HP-IB from an 
external controller. Extensive design effort went into mak- 
ing the remote programming as user-friendly as possible. 

One HP-IB innovation is the master- slave sweep. This is 



useful in testing of receivers and mixers where it is neces- 
sary to have two signal generators sweeping with a fixed 
offset between their output frequencies. One S673A is des- 
ignated as the master unit and sends out HP-IB commands 
to one or more slave #673As, Master-slave sweep can be 
performed without a computer to control the system. The 
slave holds off the HP-IB handshake until its output has 
settled. The master looks for release of the handshake before 
proceeding to the next frequency. This ensures that the two 
synthesizers track each other. 
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Fig. 5. A typical 8673 A RFpuise at a earner frequency of 26 
GHz. The horizontal scale is 20 nsidiv. 

Of particular interest to system programmers trying to 
achieve maximum performance is the ready bit in the HP-IB 
status byte. The 8673 A may take different times to settle 
after a frequency change. By sensing when the 86 73 A has 
actually settled instead of always waiting for a fixed worst- 
case delay time, the test system can run faster, The ready bit 
indicates that the 8673A has phase-locked and finished the 
YTM autopeak routine (see page 12) at a new frequency. 
The ready bit can be configured by a user program to gener- 
ate a service request interrupt. Using this feature, the com- 
puter can be performing other useful work while the 8673 A 
is busy. 

Rear-panel output connectors provide sweep and blank- 
ing voltages for sweep displays on recorders, oscilloscopes, 
or network analyzers such as the HP 875 5C or the HP 
8410B/C Stop-sweep and trigger outputs required for oper- 
ation with network analyzers are available from a 14-pin 
connector on the rear panel of the 86 73 A , 

Other interfaces provided at this connector permit useful 
functions without the need for an HP-IB controller. These 
include remote frequency incrementing and decrementing, 
a trigger- sweep input, an end-sweep output, blanking of the 
frequency display, and sequential storage register recall. 

Digital Control Unit 

One of the design objectives of the 8673A was to replace 
the combinational logic control section of the 8672 A w f ith a 
microprocessor. By using circuit boards and software de- 
signed for the 86G2A Q\01-to- 1280-MHz Signal Generator, it 
was possible to make a first breadboard controller capable 
of keyboard entry of microwave frequencies in a matter of 
weeks* With a base of proven circuits as a starting point, the 
development effort concentrated on user friendliness and 
enhanced performance of the analog circuitry and service- 
ability to try to match the expected applications. 

One of the features of the 867 3 A is digital sweep. The 
digital control unit (DCU) can completely synthesize a 
s%veep from a series of discrete frequencies. The sw r eep 
range can be entered as either start-stop or center 



frequency- Af (span sweep]. The DCU has the necessary 
firmware to calculate either pair of values from the other. 
For example, if the user enters a start frequency of 10 GHz 
and a stop frequency of 5 GHz, the 8673 A will calculate a 
center frequency of 7,5 GHz and a Af of -5 GHz. 

One area in which the 86 73 A departs from conventional 
analog sweepers is in the control of sweep rate. The 8673 A 
is primarily a microwave synthesizer, so sweeps must be 
divided into discrete steps. The user can enter the number 
of steps or the step size. Obviously, the more steps to be 
generated, the slower a sweep will be. The other aspect of 
sw T eep rate is the time per step. In a synthesized signal 
generator, this time is the sum of the transition time be- 
tween frequencies and the dwell time on each frequency. 
The 8673 A can achieve a faster overall sweep rate by allow- 
ing the user to specify the dwell time. The DCU automati- 
cally checks the loops for phase lock after each step and 
controls the timing and Z-axis blanking accordingly. 

For automatic sweep with dwell times shorter than 5 
milliseconds, the DCU does not wait for a complete phase- 
lock of all four loops. A heuristic algorithm considers 
which phase-locked loops are required to change on a par- 
ticular step and estimates the required delay to be "close 
enough" that the frequency error w f ill not be significant on 
most swept displays. As a result, the overall sweep time can 
be reduced. This makes the 8673A useful for such applica- 
tions as aligning a circuit while watching a real-time swept 
display. At the same time, the synthesized nature of the 
8 67 3 A eliminates the effects of drift commonly associated 
with analog sweepers on narrow sweeps. If the application 
requires a complete phase lock at each step, the user can 
simply enter a dwell time of at least 5 ms and the 867 3A will 
give a complete phase lock. 

The microprocessor-based controller of the 8673 A is used 
in several ways to improve performance over that of the 
earlier 8672 A. The most striking of these is the autopeak 
function. The microprocessor adjusts the tuning of the 
Y1G- tuned multiplier to track the output frequency for 
maximum pow r er and best modulation performance. This is 
described in detail in the box on page 12. 

A number of internal features add measurably to overall 
ease of use and performance. For example, the 8673 A has a 
number of functions that need timing, Instead of program- 
ming the microprocessor to go into a delay loop counting 
down numbers until the necessary time has elapsed, which 
would restrict the 867 3 A to performing one function at a 
time, an LSI multiple- timer IC and the basics of real-time , 
multitasking programming are built in. This allows the 
8673A to do several things simultaneously, 

Another internal feature is the hardware divider circuit 
Whenever the 8673A output frequency is above 6.6 GHz, 
the frequency of the YIG-tuned oscillator must be multi- 
plied. From the perspective of the DCU, the desired fre- 
quency must be divided to calculate the Y TO frequency. For 
output frequencies from 6.6 to 12.3 GHz, the YIG-tuned 
multiplier multiplies by 2, so the DCU needs to divide by 2 + 
Division by 2 is fairly easy. The binary number is simply 
shifted one place to the right. For output frequencies above 
1 8.6 GHz, the YTM multiplies by 4 t so the DCU must divide 
by 4. Division by 4 can be done by two divisions by 2, 
However, from 12.3 to 18,6 GHz, the YTM multiplies by 3 
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Sample-and-Hold Leveling System 
by Ronald K, Larson 



Qes gn goafs for the 8673A's automatic level ctr . -_I 
system were as follows 

■ Broadband power leveling from 2.0-26.5 GHz in pulse, CW. 
AM, and FM modes 

Good AM performance (1 Hz to 200 kHz, to 90% depth 
a stable leveling loop 
^ Temperature- stable output power from - 1 to + 1 dBm 
External leveling capability using a power meter or a diode 
detector 
Pulse duty cycle as low as 0,0001 

■ M rnimum power trans i en ts when changing frequency or power 
Full HP- IB control. 

The leveling loop is shown in Fig. 2 on page 4. 

One of the most difficult problems to solve in a very broadband 
leveling system is caused by the variation in gain with frequency 
of the components in the microwave signal path. In a conventional 
or linear leveling loop the voltage-gain variations directly produce 
loop-gatn variations. This can cause loop stability problems and 
degrade AM performance. Also, it is usually desirable for a mi- 
crowave signal generator to control and meter output power in 
dBm rather than volts. With a linear loop, the system reference 
voltage must be a nonlinear function of power out in dBm, and 
therefore cannot be used directly to control a meter calibrated in 
these units. 

The use of a logarithmic amplifier in the feedback path of the 
loop has a valuable effect— the microwave gain of each part of the 
signal path in the leveling loop does not affect loop gain. Instead, 
the gain factor for each microwave component operating linearly 
is 1 dB/dB- The YTM is a nonlinear voltage-gain device but 
exhibits a nearly constant gain in dB/dB, when property biased, 
for each multiplying band and over the full range of power out put. 
Typical gain factors For the YTM are 1 .0, 1 ,4, 1 .©, and 1 .8 dB/dB on 
bands 1 , 2, 3, and 4. respectively. A factory-set loop-gam adjust- 
ment in the leveling system for each multiplying band compen- 
sates for the YTM gain changes. 

A detector operating in its square-law region will produce a 
large voltage-gain variation as power output varies. This would 
produce another source of loop-gam variation in a linear loop. In 
the logarithmic loop the gain factor is simply 2 dB/dB, greatly 
reducing loop-gain variations. 

Thus the log amplifier reduces total loop-gain variations to a few 
dB over the entire range of power and frequency. Loop gain is 
also independent of variations in small-signal gam that can occur 
in the microwave amplifiers, The result is a loop with nearly con- 
stant AM bandwidth and excellent stability, 

When the log amplifier is used," its voltage output varies linearly 
with power output in d8 Thus the reference voltage into the 
summing junction is linear with RF power out in dBm. This 
simplifies the control and metering of output power since the 
reference voltage need not be shaped and can be used directly to 
control the deflection of a meter calibrated in linear dBm units. 

The exponential following the integrator lets the integrator 
voltage control the modulator output power linearly in dB. The 



exponentiator gam factor is 0.9 decade of current per vol! input 
The pin diode modulator has the property that any decade o* input 
current produces the same dB change in output power. The 
expone r ' :rf36d3 voltat 

the in teg rate r o utpui 

To produce amplitude modulation in the loop, the modulation 
voltage is summed into the summing junction after being 
loganthmicalty shaped Shaping is necessary to produce RF en- 
velope voltage variations That are linear with the AM input voltage 
The resultant AM typically has less than 5% distortfon at 90% 
depth and 100 kHz rates. YTM linearity in dB/dB is a definite factor 
in achieving this kind of AM performance. The YTM using self-bias 
rather than fixed bias can mamtam bias stability, freedom from 
parametric phenomena, and a nearly constant gam factor over its 
full dynamic range. The actual dynamic range needed for power 
output from - 1 to +1 dBm in CW mode with 90% AM depth is 
40 dB. Microwave amplifier compression can add another 10 to 
15 dB. Therefore, the total ALC loop dynamic range is 50 to 55 dB. 

The voltage output of the detector varies with temperature. The 
temperature coefficient (TC) varies with power level- To correct for 
this varying TC, the logging amplifier has a thermistor in a resistor 
network to correct for detector drift at -4 dBm power output. This 
leaves a res idual drift term at all other power levels wh ich m ust be 
corrected. This term is proportional to power level in dB. It is 
corrected by using a linear-TC resistor in the reference voltage 
circuit. The result is typically less than 0.1 dB of drift over the 
specified temperature range of 15 to 35*C and over the full range 
of power and frequency. 

Operation of the leveling loop in the pulse mode is identical to 
the CW mode since the sampling switch is closed only when RF is 
on. When the switch is open the integrator capacitor holds its 
charge, thus maintaining constant output voltage, The current into 
the pin modulator is constant when the RF is off. When RF again 
turns on. the switch closes and any charge that may have leaked 
off the capacitor is replaced. This design— where the loop actu- 
ally samples the error voltage — eliminates the requirement seen 
in some peak-leveling systems to slew the hold capacitor voltage 
to the full value of the pulse on each RF pulse. The integrator 
capacitor also provides The system's dominant pole 

The ALC loop has two selectable bandwrdths. The wide 
bandwidth allows high rates of amplitude modulation on a carrier 
and fast transient response. The narrow bandwidth is used for CW 
signals and reduces AM noise on the carrier, but has slow transient 
response. The 3673A's digital control unit automatically selects 
the wide bandwidth whenever the frequency is switched and then 
switches to the narrower bandwidth when appropriate. This al- 
lows the ALC circuit to recover more rapidly after a frequency 
change. Switching the ALC bandwidth as a function of frequency 
switching, modulation, and sweep mode would be impractical 
without a microprocessor 



MAY 1903 HEWLETT-PACKARD JOURNAi 7 



)Copr. 1949-1998 Hewlett-Packard Co. 



and so the DCU must divide by 3, A general-purpose divi- 
sion routine for the microprocessor would take more than 
20 milliseconds for this division. This is more than the 
specified worst-case frequency switching time! It would be 
unacceptable to require that much DCU processing time for 
certain output frequencies, 

The design used for division by 3 in the 8673A includes a 
special circuit, Fig. 6, to speed up this process. The micro- 
processor starts the division cycle by clearing the latch of 
any possible remainder from a previous calculation. Then it 
gets the first digit of the frequency from memory and writes 
it to the latch, For example, for a frequency of 12,345 T 678 
kHz, this is a 1. the latch stores the 1, and in turn drives the 
ROM with 01. On the next instruction, the microprocessor 
reads the ROM output through the data buffer. In our exam- 
ple, it will read a partial quotient of and a remainder of 1. 
When the microprocessor writes the next digit (2 in the 
example) to the latch, the remainder from the previous digit 
is automatically put into the latch at the same time. A 
dividend of 2 and a remainder of 1 from the previous digit 
combine to make 12. This will divide by 3 to give 4 with 
remainder. By freeing the microprocessor from manipulat- 
ing the remainder between digits and from calculating the 
absolute table address for each digit* the division routine for 
an eight-digit number can be accomplished in less than 0,2 
millisecond, This is 100 times faster than the general- 
purpose software routine, and almost as fast as division 
by 2. 

Electromagnetic Compatibility 

One of the classical problems in digital system design is 
that digital circuits tend to generate electromagnetic inter- 
ference, Preventing the signals generated in the 8673A's 
digital control unit from coupling to other circuits required 
several measures. The first and primary measure was to 
design the hardware and firmware so that the microproces- 
sor spends most of the time in a wait- for- interrupt state. 
When this happens, almost all of the logic lines in the DCU 
are quiet. As a second precaution, the buses carrying con- 
trol signals to the front panel, phase-locked loops, and 
output section are all driven by latches. The latch outputs 
only change when necessary. This further confines the gen- 
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Fig. 6. This circuit reduces the time it takes to divide by 3 by a 
factor of five over a software method. 



eration of noise to the DCU, These precautions plus good 
engineering practice with grounds and bypass capacitors 
reduce the coupling and radiation of digitally generated 
signals to a level well below the specified limits, The 8873 A 
passed the radiated-electromagnetic-interference test the 
first time a prototype was put in the screen room. Just the 
same, extensive type testing was performed to verify that 
nothing had been overlooked. 

Reliability 

With the increasing complexity of today's instruments 
reliability becomes a major concern. Considerable effort 
was devoted to thermal design, component ratings, vendor 
history and stress analysis during the design of the 8673A, 

Statistics acquired over the last five years since the intro- 
duction of the 8672A indicate a warranty failure rate well 
below the original goal. Actual warranty rate is about 32% 
per year, or an MTBF of 6700 hours assuming an operating 
time of 2000 hours per year for the instrument's 3100 com- 
ponents. 

The 8673A design followed the successful approach of 
the 86 7 2 A. Thermal profiles of the instrument were done to 
measure local temperature rises and to check that average 
rise was less than 10°C above ambient. The increased power 
dissipation of the 8673A led to a more massive heat sinkand 
supplemental individual heat sinks for the series-pass 
power supply transistors, A stress analysis computer pro- 
gram was used to evaluate each component in its operating 
environment to verify that internally generated derating 
guidelines were not exceeded. Careful failure analyses were 
performed on failed components to gain an understanding 
of the failure mechanisms and to provide vendors with 
appropriate information to rectify the problem, The failure 
rate analysis program, from which the 8672A failure rate 
has been accurately computed, predicts an 8 67 3 A MTBF 
greater than 5400 hours for 3320 components. 

Serviceability 

A number of features allow fast and easy troubleshooting 
if a failure does occur* First, every time the power is turned 
on, the DCU does a self-check of RAM and ROM. If a failure 
is detected, a code indicating the suspect IC is displayed on 
the front panel. During operation, if the DCU detects an 
abnormal condition, it displays a message on the front 
panel, For example, an output power unleveled condition 
causes the ALC UNLEVELED annunciator to be lit. This could 
be an indication of a malfunction, or simply that the user is 
trying to get more than the specified power at that particular 
frequency. Less likely failures are indicated by message 
numbers that are explained on the pull-out card. The DCU 
always attempts to continue operation. Despite a problem, 
the 8673A may still be useful for the particular measure- 
ment and service can be scheduled for a more convenient 
time. 

A special function key accessible when the top cover is 
open (or via the rear- panel programming connectors) makes 
it possible to use the controller section to simplify servicing 
the other portions of the instrument, For example, in the 
8672A it is necessary to connect a low-frequency function 
generator to the YTM drive circuit as part of the procedure 
to measure the passband and align the circuit. The 8673A 
can simply sweep the YTM fine-tuning DAC to generate the 
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ramp. The push of a couple of buttons eliminates the need 
for a piece of test equipment. 

The HP 11726A Service Support Kit includes some spe- 
cial active extender boards. The primary purpose of these 
boards is to allow the service technician to tell quickly 
whether a problem is in Lhe controller or elsewhere. For 
example, if the front- panel frequency entry and display 
appear to function but the output frequency is wrong, the 
problem could be in the controller or in a phase* locked 
loop. The numeric display on the extender board shows the 
DCU frequency output in decimal numbers. This makes 
it quick and easy to verify that the DCU is functioning 
properly. 

If the digital circuitry is malfunctioning, a number of tests 
facilitate component-level troubleshooting. A key part of 
these tests Is a 2K-byte ROM which is used only for trouble- 
shooting purposes. Normally, its data outputs are not con- 
nected to the rest of the controller circuitry. The debug 
ROM contains a test program that tests not only the other 
ROMs but also itself, if the debug ROM does not pass the 
test, the next step is to put the microprocessor into a free- 
run mode and use signature analysis to verify the address 
decoders and the debug ROM data, if this doesn't find the 
problem, it is time to check the clock waveforms and the 
power supply voltages. 

A dozen other routines are included in the debug ROM, 
Each routine is designed to test a specific part of the 8673A. 
Many of these tests also use signature analysis. For exam- 
ple, one routine exercises all of the digital logic in the 
output section. Each of the control latches is cycled through 
all valid data settings and each of the three DACs is stepped 
through a % r oltage ramp. If the DAC output waveform is not a 
si mple staircase ramp, digital signatures can be taken at the 
DAC input with the same test setup. 
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A Wideband YIG-Tuned Multiplier and 
Pulsed Signal Generation System 



by Ronald K. Larson and Lawrence A. Stark 



■^■HE KEY TO THE 2.0-tch26.5-GHz frequency range 
available at the output of the 8673A Synthesized 
Signal Generator (see article on page 3] is a broad- 
band YIG-tuned multiplier (YTM). which is shown 
schematically in Fig. ] . YIG stands for yttrium- iron-garnet, 
a ferrite material. When a YIG sphere is placed in a magnetic 
field, it exhibits a sharp resonance at a frequency that is a 
function of the magnetic field strength, 

The operating frequency range of the YTM is divided into 
four bands, which correspond to frequency multiplication 
ratios of 1, 2,3, and 4. In band 1 , step recovery diode D2 in 
Fig. 1 is forward -biased to a low impedance and no signifi- 
cant harmonic generation occurs. The four bands and the 
corresponding frequency ranges are listed below. 



Band 
Number 


Output Frequency 
Range (GHz) 


Input Frequeru 
Range (GHz) 


1 
2 
3 
4 


2,0 - 6.6 

6,6 - 12,3 

12.3 - 18.6 

18.6 - 26.5 


2.0- 6.6 

3.3 - 6.15 

4,1 - 6.2 

4.65 - 6.625 



The YTxVl consists of a standard step recovery diode mul- 
tiplier which generates a comb of harmonics of the input 
frequency. The input frequency is tunable over a broad 
range and the multiplication ratio is varied by tuning a YIG 
filter to select a single harmonic component. The multiplier 
is inherently broadband in that the comb spectrum gener- 
ated by the diode extends from the input frequency to an 
upper limit greater than 30 GHz, By tuning the YIG filter a! 



Buffered Step Recovery 
Diode Bias Voltage 
-+ ► 




YIG Filter 



-4.0V -0.55V 



YTM Injected 

Fufse 



Ft g. 1 . YIG -tuned multiplier sch ematic , A t ouip ut po wer levels 
greater than approximately dBm, the steady-state value of 
the step recovery diode bias voltage V(t) is directly propor- 
tional to the RF input voltage. The FET resistance is controlled 
to give the highest conversion efficiency consistent with stable 
parametric-free RF output. 



the output of the multiplier to one particular harmonic, all 
unwanted output signals are suppressed and the desired 
frequency is delivered to the output of the device, The input 
low-pass filter prevents the output signals from returning to 
the input and prevents harmonics of the power amplifier 
feeding the YTM from interfering with the multiplied 
signal. 

In the multiplying bands, the step recovery diode is 
biased to act as a charge con t roll ed switch whi ch prod uces a 
narrow voltage impulse when the diode switches from for- 
ward to reverse bias. The impulse width is determined by 
the circuit inductance and the diode capacitance, assuming 
that the transition time of the diode is negligible. Impulse 
widths of 40 picoseconds are necessary to obtain high con- 
version efficiency at 26 GHz. and since the diode transition 
time should be a small fraction of the pulse width, it is very 
important to obtain diodes with low capacitance and short 
transition times. 

The proper timing of the switching action i s controlled by 
the dc self-bias voltage. The ideal timing point for switch- 
ing to low capacitance occurs when the diode current has 
reached its maximum value. The resulting diode voltage 
impulse goes negative initially and then positive, at which 
point the diode switches back to the conducting state. 

The input loop inductance of the YIG filter and the diode 
capacitance form the resonant circuit for the impulse 
generator, The input low-pass filter provides an impedance 
match at the input frequency and a short circuit to har- 
monics. 

Mechanical Design 

The complete multiplier is constructed on a single sap- 
phire substrate. The design goal is to provide the closest 
possible physical proximity between the step recovery 
diode and the YIG output filter. This increases the broad- 
band capabilities of the circuit by reducing the path length 
of the unwanted harmonics that are reflected from the YIG 
filter back to the diode. 

Fig, 2 shows a photograph of the region around the YIG 
filter and step recovery diode, The diode is die-attached to 
the top surface of the chip capacitor which is the final 
element of the input low-pass filter. The capacitor is 
epoxied to the heat sink, which provides an extension of the 
ground plane in the region of the rectangular hole, This 
opening is laser-cut. while the circular hole that holds the 
0.023-inch-diameter YIG sphere is drilled ultrasonically. 

The assembled substrate is held in a magnetic package 
consisting of a center body and two magnets. The design of 
this package provides differential expansion of the magnets 
and the center body so that the gap between the magnet pole 
faces remains constant w r hen the temperature of the struc- 
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Fig. 2. (a) Photograph of the YTM circuit showing the compo- 
nents and filter, (b) Photograph of the microctrcuit showing the 
ground plane, YIG sphere t and output coupling loop 

hire changes. This keeps the YIG filter tuned precisely to 
the output frequency, 

Input Low-Pass Filter 

Optimizing the input low-pass filter for a resonance-free 
stopband up to 26.5 GHz was a major effort. The transverse 
resonances in the distributed transmission line filter ele- 
ments forced a modification of the basic design, which was 
derived from standard filter element values. It was evident 
that the final capacitive element in the low-pass filter could 
not be realized on the 0< 01 -inch-thick sapphire substrate, A 
thinner dielectric was needed to push the transverse reso- 
nances past the upper frequency limit of 26.5 GHz. A 0.004- 
inch-thick single-dielectric capacitor with a nominal value 
of 2.0 pp was chosen empirically on the basis of perfor- 
mance in actual circuit tests. This was significantly lower 
than the prototype value of 2.7 pF, 

Step Recovery Diode Characteristics 

Two different step recovery diodes [SRD] have been used 
successfully in these YTMs, Both were developed for in- 
house use at Hewlett-Packard. One diode is an outgrowth of 
HP f s standard SRD product line and the other was de- 
veloped by HP's Santa Rosa Technology Center as a second 
source. A major effort in the development of this device was 
to keep the doping profiles as abrupt as possible. Coupled 



with intrinsic- layer thicknesses less than 1 ptm + this was 
seen as the key to obtaining the shortest possible transition 
time. Two other diode parameters of importance are the 
reverse bias capacitance and the recombination time. As the 
diodes are made physically smaller, the recombination time 
begins to drop because of the influence of the sides and 
contacts of the device. Many different diodes were 
evaluated to select the optimum junction capacitance for 
maximizing the conversion efficiency at 26 GHz. The final 
diode dimensions are a compromise among aspect ratio, 
capacitance, breakdown voltage, and transition time. 

Bias Control 

Optimum RF multiplication requires that the appropriate 
dc conditions be established for the diode. This is done 
through a separate bias circuit. A blocking capacitor pre- 
vents the dc from flowing in the microwave input circuit. 
Fig. 3 shows how the SRD's dc operating point is dependent 
on the microwave signal amplitude. The circuit that 
supplies these dc conditions is called a self-bias circuit 
because the dc operating point is established by the mi- 
crowave signal input amplitude. The advantage of the self- 
bias circuit is that a stable operating point is obtained easily 
over the full dynamic range of the multiplier because the dc 
conditions follow the RF power level smoothly. The correct 
operating point is determined by the bias resistance. If the 
bias resistance is set lower than the optimum value, the 
multiplication will be stable, but the power output will be 
low. As the bias resistance is raised, an optimum point is 
found where the power is high and the operation is still 
stable. At higher resistance, either the output power drops, 
or the operation becomes unstable. 

It has been found empirically that the optimum self-bias 
resistance is a function of frequency within a single multi- 
plying band. To provide for this variation, a single [FET 
chip is used as a voltage- control led resistor on the rnulti- 
pl ier substrate. The resistance Is controlled by adjusting the 
gate voltage. 

Another way of establishing the proper dc bias is with a 
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Fig. 3. Self-bias circuit operation 
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Autopeaking 

by Paul A. Zander 



One aspect of the YTM's performance is thai it acts as a very 
narrow bandpass filter that tracks the source output frequency, If 
the tuning is off by as little as 10 MHz, maximum available power is 
reduced, the pulse modulation waveform is distorted, and the 
frequency modulation sidebands are filtered asymmetrically. 

Avoiding these problems requires tuning accuracy better than 
0.1%, Because of nonlinearities, hysteresis, and temperature 
sensitivities in the YTM, this kind of tuning accuracy is impractical 
to achieve in a straightforward open-loop tuning system. Au- 
topeak is a combination of a small amount of hardware and 1 500 
bytes of microprocessor code that adjust the YTM tuning to the 
exact center of the passband {see Fig. 4 on page 13). 

Because the peaking process introduces some perturbations 
on the output which could affect certain measurements, peaking 
is never allowed to occur spontaneously. Peaking can be pre- 
vented by a front-panel pushbutton or an HP-IB command. 

Peaking is performed in conjunction with a number of operator 
inputs. It occurs whenever peaking is switched on. when the RF 
output is switched on or the FM deviation range is changed, and 
after every frequency change that results in an output frequency 
more than 50 MHz away from the last one where peaking was 
performed. Because peaking is critical to achieving good pulse 
performance, peaking is automatically enabled and performed 
each time the pulse modulation function is turned on. The time 
required for peaking depends on the YTM passband shape and 
the amount of correction required. To maximize the measurement 
rate in automatic systems, a bit in the HP-IB status byte fs set to 
indicate that peaking has been completed. This can be used by 
the HP-IB system as an interrupt to initiate the next part of an 
measurement cycie, thus preventing the taking of false data dur- 
ing the peaking routine. 

The Peaking Algorithm 

The peaking process consists of four phases: setup, tuning the 
YTM. measuring me YTM self-bias (for pulse injection), and restor- 
ing normal operation of the S673A. During the setup phase, the 
digital control unit (DC U) suspends normal operation of the output 
section. All modulation is switched off. The internal diode ALC 
detector is selected. If a very large frequency step has just oc- 
curred, some delay is used to allow the YTM frequency to settle 
completely before using the peaking as a fine tuning adjustment. 
Then the ALC circuit is put into hold mode using circuits already 
included forthe sampJe-and-hold operation during pulse modula- 
tion, Control flags are set in software so that the 8673A can accept 
front- panel and remote commands to turn on modulation, but not 
execute the commands until peaking has been completed. 

In the peaking process, the DCU uses an eight- bit digital-to- 
analog converter to tune the YTM ±200 MHz in steps of 1 .6 MHz. 
The peak sensing circuit, Fig. 1, is used by the DCU to direct the 
search for the center of the passband. Buffer ampler A1 
amplifies the output of the detector logarithmic amplifier in the 
ALC circuit. Because of the ALC log amp, a change in the YTM 
output of 1 dB causes a change of 300 mV at the output of At 
regardless of the absolute power A2 r CI. and S1 act as a 
sample-and-hold circurt which is controlled by the DCU. A3 com- 
pares the output of the sampie-and-hold circuit, which represents 
a previous YTM tuning setting, with me present output of A1 and 
sends the result to the DCU. The entire circuit consists of a quad 
FET operational amplifier IC and a few discrete parts. 



An offset corresponding lo 3 dB can be switched in before A3. 
Without the offset, the output of A3 will change when the present 
YTM output is less than when the sample was taken. With the 
offset, the comparator output will not change until the YTM output 
is 3 dB less than the sample. The timing of the sampJe-and-hold 
operation, and whether the offset is on or off. depends upon which 
of two algorithms the DCU Is using. 

in the coarse tuning algorithm, the offset is not used, The DCU 
closes S1 long enough to charge C1 and then opens S1 . Thus a 
voltage proportional to the RF power is held on Ct . Next the YTM is 
tuned 10 MHz. After a delay o\ 200 microseconds to allow the 
tuning coil time to respond, the DCU checks the output of A3. If the 
microwave output power is greater at the new tuning setting, the 
output of A3 will be high, otherwise rt will be low. If the power is 
higher, the DCU repeats this process of sampling the output 
powe r on C1 , tuning the YTM to a new setting and checking to see 
if the YTM output is greater or less. With suitable refinements to 
ignore minor resonances in the YTM passband shape r this al- 
gorithm does a good job of locating the main peak of the 
passband. However, it does not do a repeatable job of finding the 
exact peak. Instead, it tends to stop at a point a fraction of one dB 
past the true center of the YTM passband. For achieving 
maximum power output, it is adequate and represents a major 
improvement over the oider HP 8672A, which has no automatic 
peaking. However, the pulse and frequency modulated signals 
show some distortion with even a small amount of mistuning, so a 
second algorithm is built into the DCU. 
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Fig. 1. Block diagram of the YTM peak sensing circuit. 



12 HEW LETT- PACKARD JOURNAL MAY 1983 



)Copr. 1949-1998 Hewlett-Packard Co. 



Hie oasc problem with the coarse tuning algonthm is mat the 
power output changes very slowly as the YTM tunes 
muddle of the passband Rndmg the exact middle becomes an 
e of the measurement problem a* accurately 

8 solu- 
tion was to develop a centering algorir 

The centering s based on the 3t the YTM 

passband al :-: sonably symmetrical in the reg ion from 3 to 

jelow the peak. The DCU performs the centering algorithm 
by first assuming that the initial tuning potnt is dose to the peak. It 
samples and holds a voltage on C1 which corresponds to the 
rower output Then it switches in the offset voitage corre- 
sponding to 3 dB and starts stepping the YTM until the com- 
parator output changes This point corresponds to an output 
power level 3 dB Jess than the original point Next the DCU returns 
to :he original point and starts stepping the YTM tuning in the other 
direction until the other 3-dB point has been located. The center ol 
the passband is calculated to be halfway between the 3-dB 
points. Because the YTM passband is symmetrical, it makes no 
difference whether the original point was at the exact center at 
slightly below the ce 

Normally, the open-loop tuning is accurate enough that the 
centering algorithm alone is adequate. For small frequency steps. 
the time to search is further reduced by starting at the tuning 
correction for the previous frequency. Only when the centering 
algorithm encounters a difficulty, such as when the YTM tuning 
has been grossly pulled by a reactive load impedance, is the 



coarse tuning algorithm used. 

Bias Sample 

Once the YTM frequency has been tuned, the self-bias must be 
measured for use with pulse injection. The :na m pulse 

mode. The DAC output that controls the pulse inject ion is com- 
pared with the actual YTM bias voitage for CW output at that 
awave frequency. The DCU performs a 
straightforward successive approximation aigc ?asure 

the bias voltage For a fixed frequency and output power ieve?, 
that would be the end of it. However, the injection must change as 
the output power level is changed. \i only the power is changed, 
the DCU skips the YTM tuning procedures and simply measures 
the bias for the new power level. The DCU eventually builds up a 
table in memory of bias versus power level every 0.4 dB at that 
frequency. Once the needed data is in the table, the DCU simply 
looks it up. thereby avoiding the possible user inconvenience of 
having the DCU continuously switching to CW so that the bias 
injection can be measured, Of course, when the YTM tuning is 
changed (for example, when the frequency is changed) the table 
of bias values is erased, and the data must be remeasured. 

After the YTM tuning has been corrected and the bias mea- 
sured, the DCU restores normal modulation and operation to the 
output section. The rime it takes to perform this process varies 
somewhat with the amount of tuning needed, but is typically 5 to 
10 milliseconds. In return for this brief delay, the 8873A delivers 
more output power and better modulation. 



voltage source. This works well at constant power, but 
requires external adjustment if the power level is varied. As 
shown in Fig. 3. as the power level drops, the operating 
voltage should drop too. so a vol tag e- source bias network 
needs to be adjusted as the input power to the multiplier is 
changed. 

At low power levels there is not enough diode current to 
produce the required bias. The fixed bias voltage shown in 
Fig. 3 provides the needed bias on the knee of the SRD's dc 
characteristic. 

YIG Filter 

The YFG filter is central to the operation of the overall 
YTM. The operating characteristics that had to be dealt with 
were bandwidth, crossing modes, magnet alignment, out- 
of-band rejection, acoustic squ egging, and loop inductance. 

The shape of the coupling wires affects nearly all of these 



factors. The design uses tight coupling to maximize 
bandwidth and minimize loop inductance for narrow r 
pulses, This leads to more severe crossing modes which 
have been minimized experimentally by optimizing the 
sphere orientation. As Fig. 4 shows, the YIG filter 
passbands are free of spurious modes end the 3-dB 
band widths are typically 60 MHz or more. A 3-dB 
bandwidth less than 40 MHz adversely affects the rise time 
of pulsed RF signals. 

The parameters of the YIG sphere are: 

■ Saturation magnetization 4rrM^ = 800 gauss 

■ Diameter = 0.58 mm (23 mils) 

■ Operating temperature = 8G*C 

Bias field axis = 30° off (l00) towards (lio) 
The YIG filter is temperature stabilized with a heater and 
thermistor feedback control which maintains the sphere at 
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Fig. 4, YIG titter passband 
characteristics Horizontal scale 
40 MHzfdiv, Vertical scate; 3 d&i 

div. 
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80°C, independent of external temperature changes, 

Nearly all of the key specifications of the 8673A depend 
on whether the YIG filter is centered at the output fre- 
quency. An autopeak circuit (see page 12] controlled by the 
microprocessor ensures that the YIG filter is operating at the 
center of its passband. It works by making small corrections 
to the magnet current and monitoring the output of the ALC 
detector to find the maximum output power as a function of 
magnet tuning. Thus, output power is maximized and op- 
timum pulse shape is maintained in the face of magnet 
tuning hysteresis and nonlinear tuning characteristics 
caused by reactive circuit pulling of the filter center fre- 
quency. 

Pulse Modulation System 

For the YTM fcq frequency-multiply microwave input 
pulses without producing excessively long rise times, the 
diode bias voltage V(t) must reach its steady-state value in a 
time at least as short as the rise time of the input microwave 
pulse. Using the self-bias scheme described earlier, the dc 
bias conditions are created by the rectification inherent in 
the operation of the diode. Because of charge storage in the 
step recovery diode, the rise time of the bias voltage V(t] is 
about 1 00 to 300 ns. producing microwave pulse rise times 
of approximately the same length. 

The rise time of the output microwave pulse is longest at 
the high end of each multiplying band (e,g,, at 12>3 t 18.6, 
and 20.5 GHz). This occurs because reverse- recovery cur- 
rent of the 3RD flows for a higher fraction of each input 
frequency cycle at the high end of each band. Hence it takes 
longer For the SRD bias voltage V[t) to reach its steady- state 
value and the output pulse rise time is correspondingly 
longer. 

The method used to eliminate the rise time degradation of 
the YTM is to charge the capacitor Cl in Fig t 1 to the 
required final value before the arrival of the microwave 
pulse. In this way, V(t) is at the correct voltage when the 
pulse arrives, and the RF pulse passes through the YTM 
undistorted, Cl is charged by applying a short positive 
pulse to diode Dl before the RF pulse arrives, This pulse is 
called the YTM injected pulse, Using this technique, the 
microwave pulse rise time is limited only by the YIG filter 
bandwidth, 
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Fig. 6. Pulse injection forces the step recovery diode bias 

voltage to the correct value before the RF pulse arrives. Output 

rise ttme is then a function only of the YIG filter bandwidth. 

The successful implementation of this system rests on the 
ability to predict the correct amplitude of the YTM injected 
pulse at each frequency and power level. Since the purpose 
of the injected pulse is to raise the bias voltage V(t) to the 
value it would have in normal operation, the first step is to 
measure the steady-state value of the diode voltage while 
the YTM is in CW operation- Then this voltage is used to 
control the YTM injected pulse amplitude. 

One requirement for this system to work effectively is that 
the optimum injected pulse amplitude must be a linear 
function of steady - state SRD bias voltage, The proportional- 
ity factor and offset must he relatively constant over any 
multiplying band. A simple model of the circuit predicts 
and experimental data confirms that the required linear 
relationship exists and that the variation in slope and offset 
with frequency can be accommodated by adjusting the gain 
and offset of the circuit that controls the injected pulse 
amplitude, This adjustment is made by observing pulse 
shape on each multiplying hand while adjusting the in- 
jected pulse amplitude. 

Fig. 5 shows a graph of the optimum injection pulse 
amplitude as a function of the steady- state value of step 
recovery diode bias voltage V(th Different values of bias 
voltage correspond to different microwave power levels at 
the input of the YTM. It can be seen that different slope and 
offset values are obtained depending on the operating fre- 
quency, although the total variation is not large. Within one 
multiplying band the variations are small enough that the 
behavior over the entire band can be adequately approxi- 
mated by a single straight line. 

Since the straight line is only an approximation, the ac- 
tual RF pulse at a particular frequency can have either 
overshoot or lengthened rise time depending on whether 
the injected pulse amplitude is larger or smaller than the 
optimum value. The slope and offset parameters of the 
pulse control circuit that generated the curves in Fig. 5 are 
factory-set so that the total variation of rise time and over- 
shoot is within the specified limits of 20% maximum over- 
shoot and 35 ns maximum rise time. Typical performance is 
10% maximum overshoot and 25 ns maximum rise time. 

The pulse control system waveforms are shown in Fig. 6. 
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Fig. 7. An 18.6-GHz pulse (a) 
without and (b) with pulse injec- 
tion. Horizontal scdie: 50 nsidiv. A 
26.5-GHz pulse (c) without and (a) 
with pulse injection. Horizontal 
scale: 100 nsidtv, 



The timing of the application of the YTM injected pulse is 
critical and is fixed. The pulse arrives approximately 50 ns 
before the RF input pulse to allow ringing transients to die 
out of the SRD bias voltage V(t)* When these transients die 
out, the SRD voltage is at the correct value and the output 
microwave pulse rise time is limited only bv the bandwidth 
of the YIG filter, 

Fig. 7 shows pictures oFRF pulse shapes that demonstrate 
the dramatic improvement in rise time as a result of the 
pulse injection 

The overall pulse modulation control system* Fig, 8, op- 
erates as follows, Whenever frequency changes by 50 M 1 1/ 
or more, or power changes by 0,4 dB or more, the input RF 
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voltage to the YTM can change significantly. Thus the SRD 
bias voltage V[t) may change, which will change the re- 
quired YTM injected pulse amplitude, To compensate for 
these changes the microprocessor switches the leveling 
system into the CW mode far about 200 #s. During this time 
the microprocessor changes the DAC (digital-to-analog 
converter] output until it equals the steady- state value of 
bias voltage V(t). Pulse mode is then enabled and the in- 
jected pulse amplitude is again the correct value to produce 
short-risc-time pulses. 

The DAC output voltage drives an amplifier, which pro- 
vides a gain and an offset adjustment for each band. The 
amplifier output voltage controls the injected pulse 
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Fig. 8. 8673 A pulse modulation system, 
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amplitude. 

Another requirement in the reproduction of high-quality 
output pulses is that the RF pulse into the YTMmust be free 
of ringing and overshoot. This is accomplished by using a 
series-shunt pulse modulator beiore the power amplifier. 1 
This minimizes mismatch reflections between the preamp 
output and the power amplifier input. The series pulse 
driver provides a pulse to the series diode while the shunt 
pulse driver provides a pulse to the shunt pin diodes. This 
method achieves the low reflections of a series- shunt mod- 
ulator and retains the short rise time of a shunt pin diode 
modulator. 

Also included in the pulse modulation system is a pulse 
width detector which turns on the front-panel unleveled 
indicator when the input pulse width is less than 100 ns, 
The specified level accuracy at 100-ns pulse width is ±1 dB 
relative to the CW level. Typically, specified level accuracy 
is maintained down to 80-ns pulse width. Pulse widths less 
than 80 ns are available but level accuracy is degraded, 
.Maximum pulse repetition frequency for the specified level 
accuracy is 1 MHz, Typically, specified level accuracy is 
maintained to 5 MHz, as shown in the table below. 

Pulse Performance Summary 



Parameter 


Specified 
Performance 


Typical 
Performance 


Level Accuracy 


±i& dB 


+0.4 dB 2 


(relative to CW) 






Mini mum Pulse Width 


100 ns 


80 ns 


(for specified accuracy) 






Minimum Duty Cycle 


0.0001 


0.00003 


{for specified accuracy) 






Maximum PRF 


1 MHz 


5 MHz 


(for specified accuracy) 






Pulse On/Off Ratio 


>80 dB 


>9G dB 


Overshoot 


20% 
{25%, 6.6-6.7 GHz) 


10% 


Rise Time 


35 ns 


20 ns 1 
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1. Typical system performance gives rise time loss than 25 ns on thi> multiplying bands 
and 15 ns on the 1.95-6.6 GHz band. 

2. Typical teval accuracy tela live to CW ts HQ:4 dB at 100-ns pulse width. 
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Compact Digital Cassette Drive for 
Low-Cost Mass Storage 

This portable battery-operated unit uses minicassettes to 
store programs and data inexpensively for HP-IL systems, 

by William A. Buskirk. Charles W. Gilson, and David J. Shelley 



THE HP 82 161 A Digital Cassette Drive (Fig, 1) is a 
portable, programmable, mass storage peripheral 
for the Hewlett-Packard Interface Loop (HP-IL), 1 The 
storage medium is a removable minicassette that can store 
up to 128 K bytes of information. Portability is achieved by 
the use of a four-cell nickel-cadmium battery pack T re- 
charger, and power supply system similar to that used in 
other portable HP products. The 82 161 A is styled to fit in a 
family of compact peripheral devices such as the 82143 A 
and 82 162 A Printer/Plotters, and to fit nicely in a system 
controlled by an HP-41 Handheld Computer or an HP-75 
Portable Computer. The 62161 A makes use of much of the 
package design of the 82143A Printer/Plotter, 3 producing a 
unit 178 min wide by 133 mm deep by 57 mm high. Replac- 
ing the 821 43 A '.s printer mechanism on the top right side is 
a transport mechanism with a REWIND key and a door OPEN 
key located in front. To the left of these keys is the power 
switch and indicators POWER, LOW BATTERY, and BUSY. 
The top left side of the package offers a compartment to 
store two minicassettes. The two HP-IL cables and the re- 



charger cable are connected to the 82161 A via plug recepta- 
cles on its back panel. 

Electronic System 

Fig. 2 is a block diagram of the electronic system of the 
82161A. An internal microcomputer controls the head and 
motor drive electronics for the transport assembly and in- 
teracts with the HP-IL interface logic and data buffers. 

The criteria for microcomputer selection for the 821 61 A 
included low cost, ready availability, low power consump- 
tion, and adequate I/O. To limit the number of electrical 
parts in the 82161 A, a microcomputer that also contained 
ROM, RAM, and a timer, and could generate the encoded bit 
timing during a write operation was needed. A 3870 mi- 
crocomputer with 2K bytes of ROM and 64 bytes of RAM 
was selected. 

The logical interface of the 821 61 A is a generalized mass 
storage driver that provides the capability to execute opera- 
tions such as initializing the tape, seeking a record, reading 
or writing a record* and rewinding the tape (see Table I). 




Fig. 1. The HP 82 161. A Dtgttat 
Cassette Drive is a compact 
battery-operated mass-storage 
unit designed for use in portable 
HP-IL systems 
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Table I 
HP 82161A Digital Cassette Drive Commands 



DDLO 


Write buffer 


DDTO 


Read buffer 


DDL1 


Write buffer 1 


DDT1 


Read buffer 2 


DDL2 


Write 


DDT2 


Read 


DDL3 


Set byte pointer 


DDT3 


Read address 


DDL4 


Seek 


DDT4 


Exchange buffers 


DDL5 


Format 


DDT5 


Transfer buffer 0-*l 


DDL6 


Partial write 






DDL7 


Rewind 






DDL8 


Close record 






DDL9 


Transfer buffer 0— *1 




DDLIO 


Exchange buffers 







Buffer space for two 256-byte records of data is provided. 
Buffer is used for data transfers between the HP-IL and the 
minicassette tape, and buffer 1 can be used by the HIMI. 
controller as virtual memory. The Intent is to provide space 
to store a pa^e of the tape directory and thereby reduce the 
number of seeks to the directory at the beginning of the tape. 
The DDL3 [set byte pointer), DDL8 (close record], and UDLr 
[partial write} commands allow a memory-limited control- 
ler such as the HP-41 Handheld Computer to modify parts of 
a record without having to buffer the entire record in its 
mainframe, The record is read into buffer Q\ modified, and 
written back to the tape with only the modification informa- 
tion passing around the HP-IL. 

The ability to use the 82 161 A for extended remote data 
gathering tasks has been enhanced by the addition of the 
power-up/down commands. When the power switch on the 
front-panel keyboard is in the STANDBY position and a 
loop- power-down [PWRDN) command is received, the 
drive' s power supply is turned off. When the HF-IL control- 
ler requires the loop to be active again , it sends a string of 
identify message frames, which turns the drive's power 
supply back on. 

Software 

The 2K bytes of machine code in the ROM of t In-* mi- 
crocomputer can be divided into three major areas: the 



power-on idle routine, the HP-IL routine, and the device 
control routines, The power- on idle routine, which uses 
approximately 160 bytes oj code, sel.s up the initial stale of 
the 821 61 A at power on and then alternates between testing 
for a cassette to be insert m I i nl o t he drive, the REWIND key to 
be pressed, and calling I he HP-IL routine. This routine also 
executes the device-clear and power-down functions. If 
either Command is received, the I IP-1L routine flags that fact 
and the drive responds after it finishes its latest task and 
returns to the idle routine loop, 

The HP-IL routine, which takes approximately 460 bytes, 
provides the 821 61 A with basic talker and listener 
capabilities. This routine takes care of all communication 
with the HP-IL interface chip and passes all necessary in- 
formation to the device control routines, primarily through 
a set of flag registers and one data register. This polled 
solution to HP-IL, in contrast to an interrupt-driven solu- 
tion, is required because most of I he device control routines 
need exclusive use of the microcomputer and can only give 
up control at specific times. 

The device control routines, which take the remaining 
1420 byles of ROM, can be further divided. One part is the 
command decode portion, The device control is done with 
device-dependent commands (DDCs). When the HP-IL 
routine receives a DDC that it decides is of interest to the 
82161 A, it passes the DDC on to the command decode 
routine. Either the command is executed immediately, as in 
the case of a read or exchange buffer operation, or flags are 
set to control future actions, such as write and set byte 
pointer where the flags control where data bytes are put. A 
one-byte command buffer is used to hold a DDC received 
when the drive is busy. The HP-IL ready-f or- command 
(RFC) message frame following this command is not re- 
transmitted until the present task has been completed and 
the new command has been decoded. 

The DDL5 (format) command initializes the record posi- 
tions on the tape by recording all 51 2 records on both tracks. 
Each record contains a sync byte, a byte for the record 
number, a second sync byte. 258 bytes of data (each data 
byte initialized to 255), a checksum, and a final sync byte. 
Only during initialization is the first sync, byte and the 
record number written. In all following write operations, 
the record number is read before the remaining part of the 
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Fig. 2. Block diagram of elec- 
tronic system of the 821 81 A Digital 
Cassette Drive. 
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Fig* 3. Timing diagram for signal lines and one-shot multivi- 
brator states used to decode bit values stored on the 321 61 A 
tape cassette. 

record is written. This serves two purposes. The first is to 
verify that the proper re cord is being written and the second 
is to fix the record position on the tape so that it dues nol 
move along the tape when it is overwritten. 

Tbe major criterion in selecting an encoding method for 
the 82 161 A was reliability* The tape drive system requires 
that the method have a large speed- variation tolerance and 
use a microcomputer to generate the encoded bit stream 
during a write operation. The tape lengths required to re- 
cord a one and a zero should be the same so that the length 
of a record does not depend on the ratio of ones to zeros 
within the record. Also, the code should be self-clocking for 
easy decoding. 

The method best qualified is the biphase- level or Man- 
chester code. The rules of this code are 1) there is always a 
transition in a bit cell center, and its direction specifies the 
value of the bit, and 2) there is a transition on a hit ceil edge 
only when the two bits on either side have the same value 
[see Fig. 3). 

In a write operation, the time between transitions, bit cell 
midpoint to bit cell edge, is 64 ^s. During this time the 
transport status [stall, cassette present, and tmii of I ape} is 
checked, the next nibble is read from the buffer and added 
to the checksum, and the next transition is calculated. The 
bit stream generated is sent to the sense amplifier cm the DIO 
line. 

There are two signals used in a read operation, DIN [data 
in] and DRDY [data ready], DRDY is the extracted clock and 
DIN is the latched data derived from the signal read from the 
tape. The microcomputer reads DIN and DRDY simulta- 
neously and checks for DRDY to change state. When it does, 
the value of DIN is shifted into the register building the< fete. 
While it is in the read loop, the microcomputer also checks 
the transport stains, si ores the complete nibbles in the buf- 
fer, adds them to the checksum, and maintains a counter to 
detect when signal dropouts occur. 

When the read/write routine is entered, the motors are 
turned on. the record number is read and verified, and the 
data portion of the record is then read or written. If a record 
number error and/or (in the case of a read) a checksum error 
is detected, the drive attempts the reed or write operation a 
second time. II the microcomputer still detects an error, it 
stops the drive and reports the error to the HIM I, controller. 



Seek operations are always attempted in a relative man- 
ner first. When the new record number is received, it is 
checked to see t fit is in range (ie,. <51 2 J. and the difference 
between the present position and the desired position is 
calculated. The transport is turned on to move in the ap- 
propriate direction, and by watching the DRDY Hne. the 
microcomputer counts interrecord gaps until the transport 
reaches the record immediately before the desired record. 
This record is read, and the record number is checked I 
correct, the transport is stopped with the desired record 
next. If an error is detected, the tape is rewound, and the 
seek is attempted again, but this time from the beginning of 
the tape. This gives four chances of reading the record 
correctly and ensures accurate seeks. 

Data Storage and Retrieval 

The microcomputer handles digital information to and 
Irom the read write electronics on a bit-by-bit basis using 
three data-related lines (DIN, DIO, and DRDY. see Fig. 3) 
and two control lines (REC and TRK)~ DIO is a bidirectional 
data line whose transfer direction is controlled by the state 
of the REC line. In the read mode (REC low). DIO is driven by 
the sense amplifier. w r hile in the write mode (RKC high], the 
sense amplifier goes into a high-impedance state and DIO is 
driven directly by the microcomputer. Both DIN and DRDY 
are generated by the decoder circuitry and are derived 
solely from DIO level changes. The TRK line is driven by the 
microcomputer to select which tape track (0 or 1] is read 
from or written to. 

The sense amplifier is a custom bipolar integrated circuit. 
It contains the signal conditioning and logic circuits to 
drive the magnetic head during a write operation and to 
translate the low-level analog signals at the head to time- 
related digital signals at DIO during a read operation. 

Writing to the tape is accomplished by controlling the 
current flowing through the windings of the magnetic head. 
These currents produce a magnetic field across the gap at 
the front of the head. Three wires (two ends and a center tap) 
are attached to each winding [track] on the head. During a 
write operation, the center tap is connected to a constant* 

nt sink, ami each end of the winding is a Item,. 
driven high to control the direction of the current and thus 
Ihe polarity of the magnetic field at the gap. The use of a 
current sink allows maximum rate of change of current, yet 
limits the peak direct current to 150% of that required to 
completely magnetize the tape, 

During a read operation, the voltage at the terminals of 
the head is proportional to the rate of cha nge of t he magnet- 
ic flux across its gap and reaches a peak value when the gap 
is directly opposite a flux reversal on the tape, To decode 
recorded information properly, a digital signal with level 
changes corresponding in time to these voltage peaks must 
be generated. The sense amplifier generates this signal In 
amplifying and then differentiating the analog signal from 
the head. A zero crossing at the output of the differentiator 
i in esponds to a peak of the amplified signal and is used to 
clock DIO level changes. The DIO level (high or low) is 
related to the polarity of the amplified signal at i lock time 
and indicates t bed I rut tinn of the flux transition. Hysteresis 
'■ on hided to provide protection from unwanted mmsi- 
tions caused by electrical noise. 
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Data content is encoded by the direction of the DIO level 
transition at the midpoint of a bit cell. Transitions at bit eel] 
edges are used only as required to set up DIO for the proper 
change at the next midpoint [see Fig, 3J, The decoder 
hardware ignores these edge transitions and provides the 
microcomputer with two signals — DRDY and I JIN. A change 
at DRDY notifies the microcomputer that the signal at DIN 
represents valid data. 

Kor every DIO transition a 100-ns pulse is generated and 
appears at the trigger input of a nonretriggerable one-shot 
multivibrator, The timing period of the one- shot multivi- 
brator is set so that, if triggered by a midcell transition, the 
next edge transition, when it exists, will occur during the 
cycle and thus be ignored. When the timing cycle expires, 
the level of DIO, which corresponds to the encoded bit 
value, Is latched into the DIN Dip-flop, Approximately 2 /xs 
later t the output of the DRDY fl ip-flop changes, notifying the 
microcomputer that data is valid (see Fig. 3J. 

To ensure that the one- shot multivibrator is triggered 
only by midcell transitions and that it ignores any cell edge 
transitions, a sync byte is included at the beginning of each 
record. This special bit pattern maps to a stream of level 
changes that includes only midcell transitions. Thus T the 
one-shot multivibrator is set up to be triggered only at mid- 
cell, and, if speed stays within allowable limits, synchro- 
nization is maintained throughout the entire record. 

The encoder hardware can tolerate timing variations in 
DIO of ±30%. Electronic jitter, aliasing, phase shift in the 
amplifiers, external electromagnetic noise, and true speed 
variations all contribute to the total timing variance at DIO, 
Fortunately, the wide acceptability range of the decoder 
easily overcomes these factors and ensures good unit-to- 
unit compatibility. 

Transport Mechanism 

The 8216lA's mechanical design uses an 8 x 34 x 56 mm 
minicassette designed especially for digital applications. 
The cassette contains nominally 24 meters of usable tape 
3.81 mm wide, allowing two tracks of data 1.45 mm wide, 
This tape is hub-driven as opposed to capstan-driven, 
meaning that the onJy way the tape can be moved is by 
turning the appropriate stack of tape directly. 

The selection of a hub-driven cassette was the key step in 
a "simplicity" approach to the design of the 82 161 A 



mechanism. It allows the use of a two-motor drive [one per 
hub) and eliminates additional motors or controlled ac- 
tuator devices that would be required by capstan 
mechanisms or single-motor drives. Another key to 
simplicity is the use of a two-track magnetic head. This 
eliminates having to move the cassette, either by the user or 
by a mechanism, to access both tracks. Other factors con- 
tributing to a straightforward design are the extensive use of 
inject ion- molded thermoplastics, cost-effective fasteners 
such as adhesives and press fits, and a low part count. 

Aside from simplicity, another goal of this mechanism 
design was modularity. This produced a drive system mod- 
ule that can be removed from the 8 2 1 6 1 A a nd designed i nto 
other products with a minimum of change, electrical or 
mechanical, 

The primary part of this device is the head Ira me assembly 
[Fig, 4). This assembly consists of a molded plastic frame 
into which a magnetic head is aligned and glued, and an 
optoelectronic device which forms half of the beginning- 
of-tape/end-of-tape [BOT/EOT] sensor. The single-gap. two- 
track head's coil winding parameters were chosen for both 
read and write functions. The headframe is molded from 
glass-filled polycarbonate, a very stable compound that al- 
lows some key dimensions to be held within a tolerance of 
0.025 mm, Two posts on the headframe position the cassette 
housing relative to the head and two tape guides on the 
frame guide the tape relative to the head. These features are 
molded into the headframe. 

The headframe assembly is joined with a door, window, 
and cassette pressure springs to form a door assembly very 
similar to that used in many conventional cassette tape 
recorders. The cassette is loaded into a slot in the door and 
the action of closing the door positions the cassette in the 
mechanism. When the door is released by pressing the 
OPEN key, a spring pulls it open so that the cassette can be 
easily removed. 

VY hen the door containing a cassette is shut, it pushes a 
pin, closing a switch spring located directly on the drive 
printed circuit assembly and informing the electronics that 
a cassette is present. A 45-degree mirror located within each 
cassette completes the optical BOT/EOT detection path. The 
optoelectronic device and the head in the headframe are 
connected to the drive's printed circuit assembly by a flexi- 
ble ribbon cable. 
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Fig. 4. Exploded view of the 
82 161 A headframe assembly - 



20 HEWLETT-PACKARD JOURNAL MAY i983 



)Copr. 1949-1998 Hewlett-Packard Co. 



The backbone of the transport is the mainframe. It is a 
precision part made of glass-filled polycarbonate formed by 
injection molding. All of the key subassemblies, including 
the door headframe assembly, the motors and gear systems, 
the door latch, and the printed circuit assembly are fastened 
to the mainframe to form a complete modular transport 
mechanism. 

o identical drive motors are used. One drives the left 
stack of tape and is called the forward motor. The other 
drives the right stack and is called the reverse motor. The 
use of two motors coupled directly to the tape stacks in this 
fashion makes possible a simple speed control scheme for 
reading and writing data. The motor selection involved a 
tradeoff between motor performance [hence cost) and 
product capability { primarily data capacity), lronl ess-rotor 
motors with their low inertia torque ratio were selected. 
The software definition requires interrecord gaps long 
enough to allow the tape velocity to change between zero 
and read write speed between records. Selecting low- 
inertia motors allows record length to interrecord gap ratios 
of around 3:1. If higher-inertia motors had been used, this 
ratio, as well as data capacity, would have been lower. 
Another important characteristic of ironies s-ro tor motors in 
this application is their linear tachogenerator feature. The 
construction of the motors is such that their EMF, with the 
commutation ripple filtered out, can be used to detect motor 
speed changes typically w it hi n 2 percent. Perhaps the most 
important result of the selection of low-inertia motors is the 
reduction of dynamic tape tension. With the two-motor, 
hub-drive technique, the driving motor must pull the tape 
against the other motor's inertia, When accelerating, the 
trailing motor's inertia ts multiplied by the square of the 
gear reduction as it is reflected into the tape. Hence, peak 
tape tensions are very sensitive to motor inertia. Tape life 
was found to be d ominated by dynamic tape tensions and so 
the long tape life achieved in the 821 61 A is strongly related 
to the selection of low-inertia motors. 

Motors (i mi M not be found that would go slow enough 
while maintaining the torque required to drive the tape 
hubs directly. Therefore, a speed reducer is required. 
Many methods were considered, beginning wit h the logical 
choice of motor/gearbox combinations, but these proved to 
be too expensive, O-ring arid toothed-belt drive designs 
wen? tried, but both exhibited a common problem of requir- 
ing increased belt tension to avoid slipping. The higher belt 
tension produced higher shaft frii hich in turn, led to 

Increased tape tension and reduced tape and bearing lilY. 

The 82161A uses a custom gear drive (Fig, 5] consisting 
of a pinion and drive gear for each motor with a ratio of 1 :4 
[15 teeth to f>0 teeth]. Both gears are injection-molded at a 
custom gear-molding house and exceed AGMA* quality 
No, 7. The diametral pitch is 96 and the gear material is 
lubricated acetal resin, The pinions are pressed on the 
motor shafts and the drive gears run free on ground 
stainless-steel shafts pressed into the mainframe (Fig, 5a]. 
The motors are positioned k i out r?ntric collars fastened 
also to the mainframe. Precision molding of the mainframe 
allows the motor-to-drive-gear- shaft, dimension to 1m? held 
to within 0.025 mm. Also running on the drive gear shafts, 



and axially coupled to the drive gears, are the drive splines 
(see Fig, 5b). These splines fit into the cassette hubs and 
transmit torque to them from the driven gears. In the event 
that a cassette hub does not align with the spline when the 
door ts closed, the spline is spring-loaded and can be 
pushed down, allowing the door to shut When that particu- 
lar side of the drive moves, spline relative to hub, the spring 
pushes the spline up to engage it with the hub. These spline 
springs also produce a braking action or a drag friction, This 
drag was optimized for system speed performance by ad- 
justing the spring constant and preload, 

Head Alignment 

Accurate alignment of the magnetic head in the head- 
frame is crucial to give unit-to-unit read/write compatibility 
for systems using multiple transports. This alignment is 
done electrically, with the head actually reading signals 




Snubber 



Pinion Gear 



Drive Motor 



Drive Gear 



Spline Shaft 






Fig. 5. The tape drive system in the 82 161 A uses two identical 
motors, each driving a spring -loaded spline (a) Close- up 
photograph of drive gear and motor assembly, (b) Exploded 
drawing of motor and drive gear system. 
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from two tracks simultaneously, rather than aligning opli* 
Gaily as has been done by HP In the past. L4 The head aiigner 
ton] i (insists ul ;in endleSS-loop fege tim k, an oscilloscope. 
and anac voltmeter. A master head on the tape deck is used 
to write perfectly phased signals on both tracks of a tape 
loop. This loop is then read by a head requiring alignment 
The head is held inside a head frame by a tooling fixture that 
sets penetration and varies azimuth [perpendicularity of 
the head gap relative to tape motion] and tracking (vertical 
position of the head poles relative to tape position]. The two 
signals read are observed on the oscilloscope and the 
azimuth is adjusted until both tracks are in phase and the 
combination of signals with maximum amplitudes has been 
found. This ensures that the azimuth has been "perfectly" 
aligned in the fixture. Next, the tracking is adjusted by 
moving the head up and down until the sum of the signals 
from both tracks is a maximum as measured by the voltme- 
ter. This ensures that the pole spacing on the head opti- 
mally matches the track positions on the alignment tape. 
The two adjustments are practically decoupled on the 
alignment Fixtures, so convergence is not necessary . After 
the head is aligned, the assembly and tooling fixture is 
removed, and the head is glued in place with a fast-curing 
acrylic adhesive. The headframe assembly can then be re- 
moved from its fixture and the alignment rec becked to 
observe any movement caused by glue cure. This procedure 
produces azimuth alignment better than ±5 arc-minutes, 
and tracking alignment relative to the headframe t better 
than ±0,05 mm. The master head is periodically used to 
monitor the accuracy of the alignment tape. To check the 
master head, a Mobiles tape is installed on the tape deck, 
This tape alternately presents front and back sides to the 
master head. A signal is written on the front side and read 
from the back side. The amplitude is reduced, but only the 
track-to-track phase is important. If the master head is "per- 
fectly** aligned in azimuth, no phase difference will occur, 
An iterative process is used to align the master head if 
necessary. 

System Modeling 

The electromechanical system of the 82 161 A tape trans- 
port was modeled to allow studies of tape velocities and the 
effects thereon of motor parameters and mechanism inertias 
and frictions, The basic model is shown in Fig. 6 and the 
basic equations of motion are 

I 2 6 2 = -R2[K 2 (Rzfl z -Rifl0]4^[jyMfc-i(^ 

I 3 3 = -R4[K 1 (R4^-R 3 ^)] + ^2[K2(H 1 W4-H2^)] 

1#4 =-Ri[Kz[R 1 ^-R 2 tf 3 )] 

This multi-degree-of-freedom system was studied b}* 
identifying different mn tor-to-tape-stack and motor-to- 
motor modes whose frequencies depend on the reduction 
method (belts or gears] and tape stack ratios, Many of the 
natural frequencies found by this model can be satisfactor- 
ilv filtered out by altering the servo design, but one mode 
consistently showed up in the analysis that cannot. This 
was identified as a "tape mode" or the opposing oscillation 
of both hub stacks with the spring being the length of tape 




Motor 



0j = Left motor position 
tfj = Left stack position 

03 = Right stack position 

4 = Right motor position 
h = Motor inertia 

1 5 = L eft -hub-and-tape- stack inertia 
1 3 m Right- hub -and -tape-stack inertia 



Motor 



: Connecting tape stiffness 
Gear mesh stiffness 



R, = Pinion radius 



R 2 



Huh radius 
Left stack radius 
Right stack radius 



Fig. 8, Diagram defining tape motor drive parameters used to 
model 82161 A transport system behavior. 

between the two stacks. During read/write operation, when 
the servo is controlling tape speed, this tape mode, if ex- 
cited, is superimposed on the steady-state tape velocity and 
becomes what was found to be the major cause of tape speed 
jitter. In all of the configurations studied, this jitter fre- 
quency is near 500 Hz, This tape mode cannot be effectively 
corrected by the servo because it is totally isolated from the 
motors and hence is "invisible'' to the servo. This tape 
mode creates oscillations in tape tension, but the mag- 
nitude of the oscillations was found to be less than the 
steady-state tape tension. Thus, the treatment of the tape as 
a linear spring was not negated by the fact that the tape 
cannot be put into compression. This phenomenon will be 
discussed further in reference to the slow-start circuitry in 
the motor drive, 

The tape mode oscillations would not be a major diffi- 
culty if there was no 500- Hz excitation in the mechanism to 
get them going. Unfortunately, because of the size of the 
transport the gear pitch selection was limited, and 5 00- Hz 
perturbations caused by gear backlash are unavoidable, 
Hence, two methods are used to damp the tape mode. First, 
friction is added by using the spline spring as described 
previously, and second, viscous rubber smibbers are added 
between the drive gear and the spline [see Pig. 5). In this 
position, the smibbers serially provide a viscoelastic ele- 
ment between the perturbation (gear backlash) and the ele- 
ments (the cassette stacks) producing the tape mode oscilla- 
tions, The combination of these two damping schemes re- 
duces speed jitter by approximately 50% to a range of 10% 
of average tape speed. 
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Motor Drive 

The 821 61A mechanism combines software, electronics, 
and mechanics to control both the position and the velocity 
of the tape. "fTL-eompatibie inputs to the motor drive cir- 
cuitry allow the microcomputer to select any of five possi- 
ble modes of operation. 

The fast forward and rewind modes move the tape at 76 to 
152 em's, during which time the microcomputer counts 
Lnterreeord gaps to determine tape position (record 
number}. Once the desired position has been reached, the 
slow forward mode is activated for a data redd write opera- 
tion. Forward and reverse braking is accomplished by using 
the back EMF of the trailing motor to generate a reverse 
torque to decelerate the system. 

The fast forward, rewind, and slow forward modes use 
the leading motor as the actuator and the trailing motor is 
"pulled" by the tape. The no-load friction of the trailing 
motor and its associated gears provides tape tension to aid 
speed control and help keep the tape in contact with the 
magnetic head. The forward and reverse braking modes use 
the trailing motor as the actuator and the tape as the 
mechanical link to decelerate the leading motor. 

The heart of the motor drive electronics is the velocity 
control circuitry (Fig, 7), To ensure read/write compatibil- 
ity, linear tape velocity past the magnetic head must be a 
controlled, repeatable function of tape position. Although 
holding the angular velocity of one motor constant would 
satisfy this objective, tape capacity would be severely limit- 
ed because the linear tape speed would vary over a wide 
range as the radii of the takeup and supply reels, respec- 
tively, increase and decrease. However, hoi ding the sum of 
the angular velocities of both motors constant not only 
satisfies the above requirements, but dramatically increases 
data capacity by maintaining a more uniform linear tape 
velocity, 

The input to the servo is a controllable reference voltage. 
The servo acts to hold the sum of the back EMFs of the two 
motors equal to this reference. As shown in Kig. 7, the 
forward transfer path consists of an error amplifier, a power 
stage, and the mechanical system. Tin 1 Um k i A1K summer 
forms the feedback path. All necessary frequency compen- 
sation is implemented in the error amplifier and includes a 



pole at the origin to integrate out dc errors, a low-frequency 
zero at 4 Hz to compensate a pole of the mechanical system, 
and a second pole at =^40 Hz to filter out unwanted motor 
commutation noise that appears in the feedback signal. At 
frequencies within the range of interest (40 Hz}, the open- 
loop transfer function of the system, including compensa- 
tion, consists of a single pole at the origin. Local feedback 
for the error amplifier is derived from the output of the 
power stage to minimize crossover distortion, As discussed 
earlier, the transfer function of the mechanical system is 
quite complex and includes several oscillatory modes. For- 
tunately, these modes are either at frequencies well outside 
the bandwidth of the servo or are invisible to the servo so 
that no serious electronic stability problems arise. 

A novel feature of this servo is the speed sensor which 
sums the back EMF from each motor. Since no current flows 
through the trailing motor, the back EMF is simply its 
terminal voltage and Is readily available to determine motor 
speed. However, the current required to produce drive 
torque generates a voltage across the rotor resistance of the 
leading motor which is superimposed on its back EMF. In 
the past, this speed measurement problem has been avoided 
by using either pulse- width modulators, which sample 
back EMF by momentarily removing power, or transducers 
which do not rely on back EMF. For this application low- 
frequency pulse-width modulators would dissipate addi- 
tional power in the motor and generate electrical and 
mechanical noise caused by their switching transients. 
Transducers are too expensive, too large, and require too 
much additional hardware. 

The chosen scheme dynamically sums the terminal volt- 
age of each motor and subtracts the voltage caused by the 
drive currents in the leading motor. Referring to Fig. 7, 

V D = EMF L +I M R V| 

-[fEMF L +I M R M +I M R s )-(EMF L -hJ M R M l][R3/R2) 
+ [fEMl l -I N ,R M ]-(EMF L + l M R rir EMI- r )](R;i'Rl) 

M R I -R:i P and canceling terms. 

V„ = EMF^t M RuM M R s [R3;R2)+EMF T 



C1 



C2 



Iw 



Reference 



® 



R6 



Error 
Amplifier 
R5 and 

Power Stage 



V M 



.EMFl+MRm + Hs) SI 

,EMF L+ J tt (R M ) 




R2 



EMF L +WR M )-EMF T 




R3 @ R1 



Back-EMF Summer 



Mechanics 



^ 



Fig, 7, Simplified schematic of 
velocity control servo. 
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Then, if R2 is adjusted such that Rm/Rs=R3/R2« 

V =EMFl+EMFt 
As can be seen from the derivation above, if resistance 
matching is done [using potentiometer R 2 )« the output of 
the feedback amplifier is the true sum of the back EMF of the 
motors* Rg is specified as a copper wirewound resistor so 
that its temperature coefficient of resistivity will match that 
of the motor's ironless rotor, thus holding the ratio of R M to 
R5 constant and ensuring consistent speed control over the 
full operating temperature range. 

Hie fast forward and rewind modes are implemented by 
""fooling" the servo. Grounding point A in Fig. 7 eliminates 
the feedback and causes the output of the error amplifier to 
go high and drive the motor at high forward speed, Forcing 
point B low causes V to be high, thus forcing the output of 
the error amplifier low, This, in combination with closing 
switch SI and lifting the ground on the leading motor 
[using transistor switches), results in the rewind mode. 

To use the feedback from both motors to control speed, it 
is essential that the motors be mechanically linked in a 
predictable, linear fashion. In the 8216lA t this link is the 
tape, Because the tape cannot support compressive forces, 
slack in the tape can occur and totally uncouple the leading 
and trailing motors. The typical result is a "bang-bang" 
servo action. The leading motor is driven until its back EMF 
equals the reference value. Suddenly the tape slack is taken 
up and the trailing motor begins to move and injects a step 
function into the feedback signal. The error amplifier re- 
sponds by slowing the leading motor, which allows the 
trailing motor to spool up and form another loop, This starts 
the process all over again. 

Once the 82161 A has attained stable slow forward opera- 
tion, this problem is prevented by the tape tension gener- 
ated by system friction. However, when the slow-forward 
mode is initiated, there is always some amount ol slack in 
the tape, and this slack must be eliminated first before 
accelerating to full speed, In addition, since overshoot can 
cause the same problem, the rate of change of the speed 
reference voltage must be slowed to the point where the 
servo can keep up. 

The slow-start circuit performs these functions by con- 
trolling the reference voltage to the servo. When the slow 
fonvard mode is selected, the reference is held to a low 
value for approximately 130 ms r during which time the 
slack is removed from the tape. Then, the reference voltage 
rises exponentially towards an asymptotic value, allowing 
a smooth acceleration to read /write speed without over- 
shoot. 

Acknowledgments 

The authors would like to thank Roger Quick and Tom 
Braun for their direction and leadership in the development 
of the 82161A* and George Custer for the producl package 
design, head sourcing. and innumerable other design de- 
tails. Also greatly appreciated is the work of Mark Matsler 
in developing the head alignment process, and the hard 
work of everyone else who helped bring the 82161 A Digital 
Cassette Drive to production, 

References 

1 . R,D. Quick and & L. Harper. "HF-IL: A Lou -Cost Digital Inter^ 
face for Portable Applications." Hewlett-Packard Journal. Vol, 34 , 




no, h January 1983. 

2. R.D, Quick and D.L. Morris, "Evolutionary Printer Provides 
Significantly Better Performance/' Hewlett-Packard Journal, Vol. 
31, no. 3, March 1980, 

3. D J, Collins and B.C. Spreadbury, "A Compact Tape Transport 
Subassembly Design for Reliability and Low Cost/ 1 Hewlett- 
Packard journal, Vol. 31, no. 7, July 1980. 

4. R.B. Taggart, 'Designing a Tiny Magnetic Card Reader/' 
Hewlett- Packard Journal, Vol. 25, no, 5, May 1974, 



William A. Buskirk 

Bill 'Buzzy' Buskirk joined HP in 1977 
after receiving a BSEE degree from the 
University of Colorado. He worked in 
production engineering on various cal- 
culators before moving to R&D in 1978. 
Bill worked on the card reader for the 
HP-41 Handheld Computer and the 
821 61 A Cassette Drive before assum- 
ing his current responsibility as a 
project manager. He was born in 
Bloomington, Indiana and during his 
undergraduate studies, worked for the 
National Oceanic and Atmospheric 
Administration. Bill is married, has a 
son, and lives in Albany, Oregon Out- 
side of work, he is kept busy remodeling his home, reading, camping, 
working with home computers, and helping his wife run theirgiftshop. 



Charles W. Gilson 

Charlie Gilson graduated from Caiifor- 
nia Polytechnic State University with a 
BSME degree in 1 973, He worked three 
years on computer modeling of missile 
shock isolation and air-launch systems 
before joining HP in 1976. He worked on 
the mechanical design of various cal- 
culators and the 821 61 A Cassette 
Drive's transport mechanism before 
moving to production engineering to do 
cost-reduction design. Born in San 
Francisco, California, he now lives in 
King's Valley, Oregon. He is married 
and has two children, a girl and a boy. 
His interests include raising sheep and 
slowly remodeling an otd farmhouse. 



David J. Shelley 

Dave Shelley was born in Seattle, 
Washington and attended the nearby 
University of Washington, earning a 
BSEE degree in 1973. He then joined 
HFsSan Diego Division and contributed 
to the electrical design for the 7245A 
Plotter/Printer and the 9872A Graphics 
Plotter, In 1977. he transferred to HP's 
Corvallis Division and worked on the 
electrical design for the 82143A Printer 
and the 821 61 A Cassette Drive. 
Dave is currently a project manager for 
i: reduction of portable computers. 
He is named coin venter for a patent 
related to the thermal head design for 
the 7245A. Dave is married, has two sons, and lives in Corvallis, 
Oregon, He enjoys bicycling P camping, and playing racquetball. 





24 HEWLETT-PACKARD JOURNAL MAV 19B3 



)Copr. 1949-1998 Hewlett-Packard Co. 



Scientific Pocket Calculator Extends 
Range of Built-in Functions 

Matrix operations, complex number functions, integration, 
and equation solving are only some of the numerous 
preprogrammed capabilities of HP's latest scientific 
calculator , the HP-15C. 

by Eric A. Evett, Paul J. McClellan, and Joseph P. Tanzini 



THE NEW HP-15C Scientific Programmable Calcu- 
lator [Fig. 1] has the largest number of prepro- 
grammed mathematical functions of any handheld 
calculator designed by Hewlett-Packard, For the first time 
in an HP calculator, all arithmetic, logarithmic, exponen- 
tial, trigonometric, and hyperbolic functions operate on 
complex numbers as well as real numbers, Also, built-in 
matrix operations are provided, including addition, sub- 
traction, multiplication, system solution, inversion, trans- 
position, and norms. 

The HP-15C also performs the SOLVE and / functions, 
which are very useful tools in many applications. The 
SOLVE operator numerically locates the zstos of a func- 
tion programmed into the calculator by the user. 1 The 
L operator numerically approximates the definite integral 
of a user-programmed function, 2 



Design Objectives 

The HP-15C was designed with the following goals in 
mind: 

■ Provide all functions of the HP-llC and HP-34C Calcu- 
lators in the same slim-line package used for the HP-llC 
i Provide additional convenient, built-in advanced 
mathematical functions which are widely used in many 
technical disciplines, 

Achieving the first objective posed a keyboard layout 
problem. The nomenclature for the IIP- 1 1C functions filled 
every position on the keyboard* Since the display is limited 
to seven- segment characters, functions could not be re- 
moved from the keyboard and accessed by typing the func- 
tion name as is done on the HP-41 Handheld Computers, 
Therefore, to free some space on the keyboard, only the two 
most common conditional tests are placed on the keyboard, 
x=tQ and x^y, A TEST prefix is added to access the other ten 




H M W L 



• « 



PA C K 



Fig, l. The HP-15C is an ad- 
vanced programmable calculator 
with special functions that enable 
the user to solve problems involv- 
ing matrices, integrals, complex 
arithmetic, and roots of equations . 
Its slim-line design fits easily in a 
shirt pocket. 
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tests by executing TEST 0, TEST 1 TEST 9. A table on the 

back of the calculator indicates the correspondence be- 
tween the digits and tests. This frees enough positions on 
the keyboard to add the SOLVE and J functions, plus a few 
more. 

In striving to attain the second objective, we noted that 
nearly every text covering advanced mathematics for sci- 
ence and engineering includes chapters on complex analy- 
tic functions and matrix algebra. They are fundamental 
tools used in many disciplines, Furthermore, the complex 
functions and many of the matrix operations can be viewed 
as extensions of the functions already on the keyboard. This 
is an important consideration because of the limited 
number of key positions available. Thus, our goal was to 
extend the domain of some of the built-in functions to 
complex numbers and matrices in a natural way without 
altering how those functions operate on real numbers. 

Complex Mode 

A complex mode was introduced in which another regis- 
ter stack for imaginary numbers is allocated parallel to the 
traditional register stack for real numbers (Fig. 2). Together 
they form what is referred to as the complex RPN* stack, 

The real X register is always displayed. A complex 
number a+ib is placed in the X register by executing a. 
ENTER, b, I. The user may display the contents of the imagi- 
nary X register by executing Re^lm to exchange the con- 
tents of the real and imaginary X registers. Or the user may 
hold down the (I) key to view the imaginary part without 
performing an exchange. 

ENTER, flj,. Rf , x%y : and LST x all operate on the complex 
stack, but CLx and CHS operate only on the real X register so 
that one part of a complex number can be altered or com- 
plemented without affecting the other. For example, the 
complex conjugate is performed by executing Re^lm, CHS, 
ReSlm. 

The following functions include complex numbers in 
their domain: +. -, x, +, 1/x, V^x. x*. ABS [magnitude]. LN, 
e\ LOG, 10 x y K , SIN, COS, TAN, SIN" 1 , COS" 1 , TAN"\ SINH, 
COSH. TANH, SINH" 1 , COSH" 1 , and TANH" 1 . These functions 
assume the complex inputs are in the rectangular form, 

Often complex numbers are expressed in polar form: re 
== r{cos# 4- isinfl]. In complex mode* the polar-to- 
rectangular conversion functions -*P and -*R provide a 

•Ffc&vfttse Pr:!!^!- nO&fti$ri S togic system thgf ekmingt&s the need for parentheses and: 
■equals" keystrokes n calculator ODeralicms 
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Z t =3-j4 

Z Z =T0 

Z«,=1/(1/Z 1+ 1/Z 2 ) 

Fig. 3, The complex arithmetic capabilities of the HP-15C 
make it easy to solve for the equivalent impedance of this 
parallel circuit (see text). 

convenient means for converting between the polar and 
rectangular forms of a complex number. 

Complex numbers are used extensively in electrical en- 
gineering. For example, to find the equivalent Impedance 
in the parallel circuit shown in Fig, 3, perform the following 
steps on the HP-15C: 



Keystrokes 




Calculation 


3 ENTER 4 


CHS I 


Zi 


1/x 




rm% 


1 




%2 


1/x 




iMi 


+ 




HZ 1 + 1!Z 2 


l/x 




Z^-l/tl/Z^l/Z^, 
Hold down (I) key to 
view imaginary part. 
Z eq =2-9730 - 2.1622J 



Fig. 2. To handle complex numbers • the HP-15C uses 
another register stack in parallel with the traditional RPN stack. 
Only the contents of the X register in the real stack are dis- 
played. 



-*F Convert to phasor form. 

Z eq - 3.6761 Z. -36,0274 c 

This example is a very elementary application of the 
built-in complex function capability. Since complex opera- 
tions can be used in conjunction with the SOLVE and J" 
I'll net ions, the HP-l.5Ccan be programmed to carry out some 
sophisticated calculations such as computing complex line 
inlegrals and solving complex potentials la determine 
equi potential lines and streamlines, 3 

Matrix Descriptors 

No set of matrix operations is complete without addition, 
subtraction, multiplication, system solution, and inversion. 
To provide these operations on the HP-15C. it seemed 
natural to extend the domains of the +, - t x. +. and 1/x 
functions to include matrix arguments. Since these func- 
tions operate on the stack contents, a means of placing a 
matrix name (descriptor] on the stack is essential. The set of 
alpha characters that can be represented in a seven-segment 
font is limited, but the letters A. B. C t D, and E have reason- 
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RbCdE 



Fig, 4. The seven-segment font used m the HP-75Cs fiqutd- 
crystal display allows representations of the alphabetic 
characters A, 8, CD, and E as shown above for use m labeling 
ces. 

able representations [Fig. 4). 

Thus the decision was niade to allow up to five matrices 
to reside in memory simultaneously* named A< B t C, D, and 
E. Their descriptors axe recalled into the X register by the 
sequence RCL MATRIX followed by the appropriate letter. 
Wh en the X register contain sa matrix descriptor, the matrix 
name and dimensions are displayed. Matrix descriptors 
may be manipulated by stack operations and stored in regis- 
ters just like real numbers, and certain functions accept 
matrix descriptors as valid inputs. For example, suppose C 
and D are 2-by-3 and 3-by-4 matrices, respectively, which 
are already stored in memory. To compute the matrix prod- 
uct CD and place the result in matrix A. the user parallels 
the steps required for real number multiplication, except 
that the result destination must be specified: 



Keystrokes 

RCL MATRIX C 

RCL MATRIX D 
RESULT A 



HP-15C Display 

C 2 3 

d 3 4 

d 3 4 



At this point, the HP-lsC's KPN stack contains the informa- 
tion shown in Fig, 5a. The matrix operands are in the stack, 
and the result matrix is specified. The user now executes x 
to compute the matrix product. When x is executed, the 
presence of the matrix descriptions in the Y and X registers 
is detected, the matrices are checked for compatible dimen- 
sions, the result matrix A is automatically dimensioned to a 
2-by-4 matrix, the product is computed, and the matrix 
descriptor of the result is placed in the X register and dis- 
played (Fig. 5b), 

The operators + and - work similarly, and ■#■ performs the 
matrix operation X _1 Y if the X and Y registers contain 
matrix descriptors. This is useful for linear system solution, 
since the solution to the matrix equation XR = Y is 
R-X _1 Y. The 1/x function key performs main 
inversion, 

Other important matrix operations that are not natural 
extensions of functions on the keyboard are accessed by the 
prefix MATRIX followed by a digit. These include transpose, 
determinant, and matrix norms, A table on the back of the 
calculator indicates the correspondence between the digits 
and matrix operations. 

Internal Format of Descriptors 
Normal lloal i'n^-poini numbers are internally rep- 



resented in the HP-15C by using a 14-digit (56-bit) binary- 
coded -decimal (BCD) format (Fig. 6 J, 

The exponent e is given by XX if XS = Q\ and by 
-( 100— XX) if XS=9. The value of the number is interpreted 
as ("l) S (.M.MMMM\iMMMM)xiO e . For example. 

123400000000 2 represei ■ 1 2 



and 



91234000000994 represents -1.234x10" 



Matrix descriptors, on the other hand, are distinguished 
by a l in the mantissa sign digit and a hexadecimal digit 
corresponding to the matrix name in the most significant 
digit of the mantissa field. For example, the matrix descrip- 
tor C is represented internally as 1C00Q0QQ0QQO00. 

When a matrix descriptor is detected in the X register, the 
matrix name is displayed, and the current dimensions of 
that matrix are fetched from a system memory location and 
also displayed. 

Creating Matrices and Accessing Individual Elements 

A matrix is dimensioned by entering the row and column 
dimensions in the V and X registers of the stack, respec- 
tively, and then executing the DIM prefix followed by the 
matrix name. Individual matrix elements are accessed by 
executing the STO or RCL prefixes followed by the matrix 
name. The element accessed is determined by the row and 
column indexes stored in registers RO and Rl , respectively. 
Matrix data is usually entered or reviewed from left to 
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Fig, 5. Before multiplying matrices CandU, the information in 
the RPN stack is located as shown m (a) After multiplication, 
the result matrix A, is located as shown in (b) and the LSTx 
register contains the information for matrtx D 
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right along each row and from the first row to the last. To 
facilitate this process, a user mode is provided In which the 
indexes are automatically advanced along rows after each 
STO or RCL matrix access operation. After the last element 
of the matrix has been accessed, the indexes wrap around to 
1,1. As an added convenience, executing MATRIX 1 ini- 
tializes the indexes to 1,1. 

The following example illustrates some of these features 
by solving the following matrix equation lor C; 



, Mantissa Sign 



Exponent Sign . 



1 * 



Ten -Dig it Mantissa 



Two-Digit 
Exponent 



+: S=0 



+ : XS=0 
-: XS=9 



Fig. 6. The internal representation for floating-point numbers 
in the HP-15C uses a 14-digit (56-bit), binary-coded-decimal 
format. 



p 5 -2i rcd.i] c(i.2)i r 8 3 1 

|_ 4 6 j Lc(2.1) c(2,2)J ^ 2 -ej 



Key: 


stroke 


5 


Display 


Comments 


USER 








Select USER mode. 


MATRIX 


1 






Initialize indexes in 
registers K0 and 
Rl to 1 


2 ENTER DIM A 


2,0000 


Dimension matrix A to 










2 by 2. 


DIM B 






2.0000 


Dimension malrix B to 
2 by 2. 


5 STO 


A 




5.0000 


a(l r l] 


2 CHS 


STO 


A 


-2.0000 


a(1.2j 


4 STO 


A 




4.0000 


mm 


6 STO 


A 




6.0000 


a (2, 2). Indexes wrap 
around to 1,1. 


B STO 


6 




8.0000 


HiM 


3 STO 


B 




3.0000 


b(l,2) 


2 STO 


B 




20000 


mm 


6 CHS 


STO 


B 


-H.O0O0 


bl2.2|. Indexes wrap 
around to 1,1, 


RCL MATRIX 


B 


b 2 2 


Recall right-hand side 


RCL MATRIX 


A 


A 2 2 


Recall coefficient matrix 


RESULT 


C 




A 2 2 


Specify matrix C as result 


* 






C 2 2 


Compute C-A" 1 B> 


RCL C 






1.3684 


6(14) 


RCL C 






0.1579 


0(1,2) 


RCL C 






-0.5789 


c[2,l) 


RCL C 






-1.1053 


c(2,2) 



Available Matrix Memory. Speed 

A maximum of 64 matrix elements can be distributed 
among the five matrices. Since the HP-15C can invert ma- 
trices in place, up to an 8-by-8 malrix can be inverted. There 
is also enough memory to solve a 7-by-7 linear system of 
equations, Table 1 specifies the approximate time required 
to perform certain matrix operations. 



Table I 
Time in Seconds for Selected Matrix Operations 



Order of 
Matrix 

1 
2 

3 
4 
5 
6 
7 
8 



Determinant 

0,5 
1,3 
2.8 

5.3 
9,1 

14 
21 
30 



Solving a 


Matrix 


System 


Inversion 


(hfi 


0.5 


2.0 


1.8 


4.2 


5,3 


7.6 


12 


12 


22 


19 


36 


28 


55 


— 


80 



Designing the Complex Function Algorithms 

After deciding to extend the real- valued functions and 
the RPN stack to the complex domain, our next step was to 
design the algorithms for complex arithmetic, Although 
their defining formulas are very simple, some disturbing 
examples made us question what accuracy should be 
achieved to parallel the high quality of the real- valued 
functions. 

The real functions are generally computed with a small 
relative error (less than 6 x 1 ~ 1 u ) except at part icular points 
of certain functions, where it is too costly in execution time 
or ROM space for the result to be computed that accurately. 3 
The relative difference R[x,y) between two numbers x and 
y is given by 



R(x.y] - 



x-y 



|y| 



When X is an approximation of x, then we say R(X,x} is the 
relative error of the approximation X. Notice that the size of 
the relative error is related to the number of digits that are 
accurate. More precisely, R(X*x) <0,5xi0~ n implies that X 
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is an approximation to x that is accurate to n significant 
digits. 

If we always wish to obtain smal! relative errors in each 
component of a complex result, then the outcome of the 
following example is very disappointing. For simplicity we 
will use four-digit arithmetic, instead of the 13 digits used 
internally to calculate the 10~digit results delivered to the X 
register of the calculator. 

Example 1: Using the definition for complex multiplica- 
tion, 

[a + ib][c + id] = [ac - bd\ + (ad + bc)i. 

consider the four-digit calculation of Z x W, where 
Z = 37.1 + 37.31 and W = 37.5 + 37.31 We get, 

Z X W = (1391 - 1391) -+- (1384 4- 1399)i 
= + 27831 

Since the exact answer is -0.04 + 2782. 58j, it is clear that 
accurate components are not always achieved by a simple 
application of this formula. The difference oxc— bxd has 
been roun ded of f to resu 1 1 i n a loss of all s igni f scant d igits of 
the real part. The loss can be eliminated, but the calculation 
time would increase roughly b} r a factor of 4. Is it really 
worth this higher cost in execution time? For comparison 
we will consider an alternative definition of accurate com- 
plex results. 

Complex Relative Error 

As with real approximations we often want our errors 
small relative to the magnitude of the true answer* That is to 
say, we want | (approximate value) -(true value) j,' | (true 
valuc)| l0 be small enough for our purposes, So relative 
error may be extended to the complex plane by R(Z T z] = 
\Z — 2 1 1 \z | , This extension may be applied to vectors in any 
normed space. A simple geometric interpretation is illus- 
trated in Fig. 7. Approximations Z of z will satisfy R(Z,z) <5 
if and only if the points Z lie inside the circle of radius S [ z j 
centered at z. This condition for complex relative accuracy 
is weaker than that for component accuracy. If the errors in 
each component are small, then the complex error is small. 
To show this, assume that R[X,x) < fi and RjY.y) < 5 where 
z=x+iy. Then, 

R(Z,z)- {X-x] + i(Y-y)|/|z| 

^ X -x|/(z| + |y-y|/|z[ 

^R(X,x) +R(V.y) 

<28 

Actually, R[Z r z) is less than 8, but this is slightly more 
difficult to show. On the other hand, however, a small 
complex error does not imply small component errors. Re- 
ferring back to Example 1, we see thai R|ZIV\ zw]= 0.000 2. 
which is respectably small for four-digit precision, even 
though the real component has no correct digits. 

It is not unusual for only one component to be inaccurate 
when the result is computed accurately in the sense of 
complex relative error, In Fact, because the error is relative 
to the size |z | . and because this is never greatly different 
from the size of the larger component, only the smaller 
component can be inaccurate. 




-«l<l 



Fig. 7. A simple geometric representation of complex relative 
error R(Z : 

To show this we shall assume, without loss of generality, 
that |x| is the larger component. Then 



|z|/|x| = V 1 + 



ylx\ 



which implies that l=s |z/x| «S V 2, since \y\ ^ |xj by 
assumption. Thus |x| and |z| do not differ greatly. The 
important part Is that |x| 5* |z|/V~2? This gives 

|X-x|/|x| *£ |Z-z|/|x| ^ Vz\Z-z\i\z\ < V2R(Z,z) 

So the relative error of the larger component (assumed to be 
x here) is very nearly as small as the complex relative error 
bound R(Z,z). It also follows that the smaller component is 
accurate relative to the larger component's size (i.e., 
[Y-y|/|x| ^ |Z-z|/|x| ^V~2R(Z,z). 

This provides a quick way to determine which digits of a 
calculated value can possibly be incorrect when it is known 
that the calculated value has a certain complex error, By 
representing the smaller component with the exponent of 
the larger component, the complex error indicates the 
number of correct digits in each component. 

For instance, in Example 1 we obtained the approxima- 
tion Z = + 2783i of the true answer -0.04 + 2782. f>8i. 
Since the larger component is 2.783x10 we will represent 
the first component with the same exponent (0,000 x 10 ) to 
obtain Z =0000,0 +2783i. These components must be accu- 
rate to nearly four digits since R(Z,z) = 0.0002. 

Perhaps the zero component of Z confuses the issue here, 
so another example may be appropriate, First, let 



Z = 1,234567890X10 



■10 



2.222222222X10 



-3; 



Then think of Z as 

Z = 0,0000001234567890X10" 3 + 2.222222222 xiO~ 3 i 

If the complex relative error indicates 10-digit accuracy, 
i,e. t R(Z,z) < 0.5X10 H ! then this implies that the first 10 
digits are correct, that is, 

Z = 0,000000123x10"^ + 2,222222222X10~ 3 ] 
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Error Propagation 

We have seen t hat computing the product of two complex 
numbers in the straightforward manner does not necessar- 
ily result in a small error in each component (Example 1), 
However it can be shown that the product does have a 
complex relative error bound of roughly 10 ~ n whenever n 
digits of precision are used in the calculation. Moreover, 
small relative errors in the input values give rise to relative 
errors nearly as small in the output values. This is not true 
for small component errors. One acceptable rounding error 
in an input value may produce an inaccurate component, 
even when the multiplication is exact. This is illustrated by 
the following example. 

Example 2: Let z = (1 +■ 1/300) + i and w = 1 + i, then 
using four- digit precision we have 



Z = 1.003 f l.OOOi 
and W = 1.000 + l,000i 



Therefore, 



ZW = 0,003 + 2-0031 



discussed further in the following section. 

In general, the HP-15C delivers complex results that 
satisfy R(Z,z) <6x 10~ 1D , except where functions involving 
trigonometric calculations (in radians) are evaluated at very 
large arguments or near transcendental zeros such as mul- 
tiples of it. This inaccuracy is embedded in the real-valued 
functions and is an example of an error that is too costly to 
correct completely. 3+4 

Some Specific Complex Functions 

For complex arithmetic we obtained accurate results (i.e., 
small complex relative errors) from the standard formulas 
used to define each operation. But, in general, defining 
formulas are usually not accurate for computers, In this 
section we will single out two particular functions, sin(z) 
and V*I, and very briefly focus on some difficulties that 
arise, 
■ Sin(z). A typical defining formula for the complex sine 

function is given by 



,-Jz 



= 3.000 X 10" 



2.0031 



sin(z) 



2i 



(1) 



exactly, yet 



zw =3.333 X 10 " 3 + 2.003] 



to four digits. The single rounding error of 1+1/300 
— * 1*003 in the component of the input Z was magnified 
from a relative error of 0.0003 to 0.1. 

So, in general, computing accurate components will not 
improve the result of a chain calculation because inter- 
mediate input values are often inexact [this is the idea of 
backward error analysis and is explained more fully in 
reference 3), It is important to realize that this is not, in 
itself, a good reason to forsake accurate results based on the 
assumption that the input values are not exact. For exam- 
pie, if we assume that X has an error in its eleventh digit and 
thus decide that sin(X) for X> 10 5 degrees, say, need not be 
computed accurately, then we would have failed to provide 
a useful result for those special cases where we know that 
the input value is exact. 

As a simple illustration consider accurately calculating 
the value sin(l, 234,567,899, 1234567890) where the argu- 
ment is in degrees. Using 

sin( 1,234,567,899) = 0.9876883406 

is grossly Inaccurate. Instead, let x = 1*234567899x10* and y 
= 0.123456789. then evaluate 

sin(x+y) = sin(x)cos(y) + cos(x)sin(y). 

Here we know x is exact, and since sinfx) and cos(x) are 
computed accurately by the HP-15C, the final result 
sin(x+y) = 0,9873489744 is very accurate. 

The point here is that clean results (in particular accurate 
components] are desirable, but in our estimation the cost of 
adding ROM and increasing execution time was too high on 
this machine to provide complex arithmetic that is accurate 
in each component. However, accurate components are de- 
livered in those functions where it is more practical. This is 



[t this is used to compute sin(z) for small |z| t the two 
exponential terms will be nearly equal and thus cause a 
loss of accuracy. This will result in a large complex 
relative error even though each step of the calculation is 
very accurate. If equation (1) is replaced by 



sin z = sin(x) cosh(y) + i cos(x) sinh(y) 



(2) 



where z = x+iy, then the relative error problem for small 
|z | will be solved, and furthermore the components will 
become accurate (except for the trigonometric difficulty 
with large angles mentioned earlier), To observe the 
striking difference in results, we calculate 

w = sin (L234567X10" 5 + 9,876543 Xl0~ 5 i] 

for each formula. The outcome is represented below. 



Eqn. 



W (10-digit calculation of w] 



(1) 1,234567006 xlfj' 5 + 9,876530000 xl0" 5 j 



R(W,w) 
10 - e 



12) 



1.234567006X10"= + 9.876543015x10"^ 10 



The HP-!5Cs internal calculation is based on equation 
(2). with minor modifications that exploit the relation- 
ships between the real functions to eliminate redundant 
computation. 

VT. The most common definition of the principal square 
root is 



V z =V |z| e 



f#2 



(3) 



where & is the Arg [z} f satisfying -7t<Q^tt. 

This formula is accurate with, respect to complex relative 
error but not accurate in each component. This can be seen 
by working through the calculation of V^, where a = — 1 + 



(-ixio liJ i), with 10-digit precision. Here 812 rounds to 
precisely 90 degrees, thus causing V a~^G— i . while the true 
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value rounds to 5 x 10 ~ 15 — i . The complex error is small but 
certainjnfoimatioD in the real component is lost, The fact 
that V a lies on the right side of the imaginary axis can be 
critical when computing near discontinuities called branch 
cuts. For example, ln( — i\ a) ■= -iufi, but the inaccurate 
component of V a will cause it to evaluate to iid2 since 
q is near the branch cut of In(zJ, More will be said about 
branch cuts in die nexTsection. 

It turns out that V z can be computed with accurate 
components and without loss in execution time. This func- 
tion, along with the Inverse trigonometric and hyperbolic 
functions, is computed on the HP-15C with accurate com- 
ponents. Their algorithms are not described by a simple 
formula as with sin(z) in equation (2), but rather are de- 
scribed in terms of their components. These accurate com- 
ponents are achieved by recognizing and eliminating errors 
such as those described above. 

Principal Branches 

The function V z is an inverse function of f(zj = z 2 . As is 
often the case with defining inverses, we must select from 
more than one solution to define the principal branch of the 
inverse, This is done for the real function by selecting the 
non-negative solution of x 1 = a and denoting it by V"a. 
Because of the branch point at G\ any branch for >/a must 
have a discontinuity along some slit (branch cut). In equa- 
tion (3) above, it is along the negative real axis. Notice in 
Fig. 8 that values below the negative real axis map to values 
near the negative imaginary' axis, while above the slit, val- 
ues map near the po sitiv e imaginary axis. Since it is tradi- 
tional to have i = VI we must attach the slit (negative 
real axis] to the upper half plane, making it continuous from 
above and not from below, that is t -n<9^ir. One will 
occasionally see V z defined forO^0<27r, which places the 
discontinuity along the positive real axis. We have avoided 
doing something like this in the branches of all of the 
complex inverse functions so that each will be analytic in a 
region about its real domain. This is important since com- 
plex computation is often perfotmed in a region about the 
real domain in which the function's values are defined by 
the analytic continuation from the real axis. 

The placement of the branch cuts andj.be function values 
along the slit are fairly standard for V z and ln(z), but the 
inverse trigonometric and hyperbolic functions have not t as 
yet, become standardized. However t by following a few 
reasonable rules there is not much room for variation, 

The first rule, analyticity about the real domain, has al- 
ready been mentioned. Secondly, we have tried to preserve 
fundamental relationships such as the oddness or evenness 
of functions (e.g«,sin(-z) = -sin(z])and the computational 
formulas relating functions to the standard principal 
branches of Infz) and \fz (e.g., tt/2 -si n[z) = g[z) V 1 - z 
where g[z] is analytic; at 1 , that is, a power series inz-1). 

The determination of formulas involving a choice of 
branches is often quite complicated. W*M, Kahan has pre- 
sented a very enlightening discussion 5 of branch cuts and 
has pointed out to us that the HP-15C branch cuts should 
s,3 r i sty certain simple formulas relating them to the princi- 
pal branch of Jn(z). These formulas are satisfied and are 
reproduced below. 



and 



Infz) - ln(|s|)+iAig(z) 

\ z = expfln(z)2) 



with —n < Arg(z) ^ it and VO =* 

acctanh(z) = [infl + z) - ln(l -zj]/2 
= -arctanhf— zl 

arctan(z) = — i arctanh(iz) 
= -arctan(-z) 



arcsinh(z) = ln(z + Vl +z 2 ) 
= -arcsinhf— z) 

arcsin(z) = — i arcsinh(iz) 
= -arcsin(-z} 

arccos(z) = W2 - arcsin(z) 

arccosbfz) = 2 ln[V(z + 1)12 + V(z - l]/2 ] 

These are not intended as algorithms for computation, but 
as relations defining precisely the principal branch of each 
function. 

Matrix Calculations 

As mentioned earlier, the HP-15C can perform matrix 
addition, subtraction, and multiplication. It can also calcu- 
late determinants, invert square matrices, and solve sys- 
tems of linear equations. In performing these last three 
operations, the HP- ISC transforms a square matrix into a 
computationally convenient and mathematically equiva- 
lent form called the LU decomposition of that matrix. 

LU Decomposition 

The LU decomposition procedure factors a square matrix, 
say A, into a matrix product LU. L is a lower-triangular 
square matrix with Is on its diagonal and with subdi agonal 
elements having values between -1 and 1. inclusive. U is an 
upper-triangular square malrix. The rows of matrix A may 
be permuted in the decomposition procedure. The possibly 
row-permuted matrix can be represented as the matrix 





I ¥ 


* 




-~ *K--__^ - _^ 




+—■— 




H 




~*~~ 


r— A 


(a)Z=w 2 


{b)w=Vz~ 





Rg. 8. The complex function Z - w 2 , shown in (a) has an 
inverse function w = \fz> shown m (b), which maps the Z 
plane onto the right half plane of w with a branch cut along the 
negative reai axis of the Z plane. 



MAY 1963 HEWLETT-PACKARD JOURNAL 31 



)Copr. 1949-1998 Hewlett-Packard Co. 



product PA for some invertible matrix P. The LU decom- 
position can then be represented by the matrix equation 
PA = LU or A = P _1 LU. 

The HP-15C uses the Doolittle method with partial pivot- 
ing to construct the LU decomposition. It constructs the 
decomposition entirely within the result matrix. The 
upper-triangular part of U and the subdiagonal part of L are 
stored in the corresponding parts of the result matrix. It is 
not necessary to save the diagonal elements of L since they 
are always equal to 1. 

Partial pivoting is a strategy of row interchanging to 
reduce rounding errors in the decomposition. The row in- 
terchanges are recorded in the otherwise underused XS 
format fields of the result matrix's diagonal elements. The 
recorded row interchanges identify the result matrix as 
containing an LU decomposition and the result matrix's 
descriptor includes two dashes when displayed. 

The determinant of the decomposed matrix A is just 
(-l) r times the product of the diagonal elements of U, 
where r is the number of row interchanges represented by P. 
The HP-15C computes the signed product after decompos- 
ing the argument matrix A into the result matrix. 

The HP-15C calculates the inverse of the decomposed 
matrix using the relationship 



[P- ! LU] 



U" l L _1 P 



It does this by inverting both U and L, computing the prod- 
uct of their inverses, and then interchanging the columns of 
the product in the reverse order of the row interchanges of 
A. This is all done within the result matrix, 

Solving a system AX=B for X is equivalent to solving 
LUX-PB for X, where PA=LU denotes the LU decomposi- 
tion of A. To solve this system f the HP-15C first decomposes 
the matrix A in place. The calculator then solves the matrix 
equation LY=PB for matrix Y [forward substitution) and 
Finally UX=Y for matrix X (backward substitution)* placing 
the solution X into the result matrix. 

The LU decomposition is returned by a determinant or 
system solution calculation and can be used instead of the 
original matrix as the input to subsequent determinant, 
matrix inverse, or system solution calculations. 

Norms and the Condition Number 

A norm of a matrix A, denoted by |j A || , is a matrix 
generalization of the absolute value of a real number or the 
magnitude of a complex number. Any norm satisfies the 
following properties: 

|| A [I ^0 for any matrix and || A || =0 if and only if 
A-0 

|| aA || = |a| x |j A || for any number a and matrix A 
■ || A + B || *s || A || + || B || for any matrices A and B 
i || A B || ^ || A || x || B |j for any matrices A and B, 

One measure of the distance between two matrices A and B 
is the norm of their difference. || A-B || . A norm can also be 
used to define a condition number of a square matrix, which 
measures the sensitivity of matrix calculations to perturba- 
tions in the elements of that matrix, 



The HP-15C provides three norms. The Frobenius norm 
of a matrix A t denoted || A || p H is the square root of the sum 
of the squares of the matrix elements. This is a matrix 
generalization of the Euclidean length of a vector. 

The HP-15C also provides the row (or row-sum) norm. 
The row norm of an m-by-n matrix A is the largest row sum 
of absolute values of its elements and is denoted by || A || R : 



A|| r = max £ |a iM 



V 

i = i 



The column (or column-sum] norm of a matrix A is denoted 
by II -A || c an cl is the largest column sum of absolute values 
of its elements. It can be computed as the row norm of the 
transpose of the matrix A, 

For any choice of norm, a condition number K(A) of a 
square matrix A can be defined by 

K{A] = || A || x || A" 1 || 

ThenK[A) = || A || x || A" 1 || M AA" 1 || = '111*1 
for any norm. The following discussion assumes the condi- 
tion number defined by the row norm. Similar statements 
can be made for the other norms. 

If rounding or other errors are present in matrix elements, 
these errors will propagate through subsequent matrix cal- 
culations. They can be magnified significantly. Consider, 
for example, the matrix product AB where A is a square 
matrix. Suppose that A is perturbed by the matrix AA. The 
relative size of this perturbation can be measured as 
|| AA || / 1| A || . The relative size of the resulting perturba- 
tion in the product is then 



AA B It / 1| A B 



AAA T A B || / 1| A B 

i ii 



^ || AA A 

m ||AA|| x || A 



i 



= Kf A) || AA || / 1| A || 

with equality for some choices of A 1 B, and AA. Hence K(A] 
measures how much the relative uncertainty of a matrix can 
be magnified when propagated into a matrix product. 

Uncertainties in the square system matrix A or the matrix 
B of the system of equations AX=B will also propagate into 
the solution X. For smal 1 relati ve uncertainti es A A i n A , say 
|| AA || / 1| A || <<1/K{A), the condition number is a close 
approximation to how much the relative uncertainty i n A or 
B can be magnified in the solution X. 6 

A matrix is said to be ill-conditioned if its condition 
number is very large. We have seen that errors in the data — 
sometimes very small relative errors — can cause the solu- 
tion of an ill-conditioned system to be quite different from 
the solution of the original system, In the same way. the 
inverse of a perturbed ill-conditioned matrix can be quite 
different from the inverse of the unperturbed matrix. But 
both differences are bounded by the condition number; they 
can be relatively large only if the condition number is large. 
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Singular and Nearly Singular Matrices 

arge condition number also indicates that a matrix is 
relatively close to a singular matrix (determinant — 0), 
Suppose that A is a nonsingular matrix. 

1K(A) = rain(||A-SM|A||) 

audi || A" 1 || - min ( || A-S I ). 

where each minimum is taken over all singular matrices S. 6 
1/ 1| A" 1 II is the distance from A to the nearest singular 
matrix, 1 K{A) Is this distance divided by the norm of A. 
For example, if 



then 



1 0.9999999999 J 



■[ 



-9,999,999,999 
10 10 



10 io 
-10 1D 



] 



and || A _1 || —2 x 10 10 . Therefore, there should exist a per- 
turbation matrix AA with || AA || = 5xlQ-n that makes 
A + AA singular. Indeed, 



AA 



[0 -5 xio-n-l 

5 x 10" 11 m 



has || AA || =5 xlO" 11 . and 



A + AA 



•[; 



0,99999999995 
0.99999999995 



] 



is singular. 

In principle, because the HP- 15Cs matrices are bounded 
in size, exact arithmetic and exact internal storage could be 
used to ensure 10-digit accuracy in matrix calculations. 
This was considered prohibitively expensive, however. In- 
stead, the KP-IdC is designed to perform arithmetic and 
slore intermediate calculated values using a fixed number 
of digits, 

Numerical determinant, matrix inversion, and system 
solution calculations using a fixed number of digils intro- 
duce rounding errors in their results. These rounding errors 
can be conceptually passed back to the input data and the 
calculated results interpreted as exact results for perturbed 
input data A + AA. If the norm of the conceptual perturba- 
tion AA is comparable to 1/ || A' 1 || , the original nonsingu- 
lar input matrix A may be numerically indistinguishable 
from a singular matrix. 



For example, a square matrix is singular if and only if at 
least one of the diagonal elements of U, the upper triangular 
matrix in the LU decomposition of A, is zero. But because 
the HP-15C performs calculations with only a finite number 
of digits, same singular and nearly singular matrices cannot 
be distinguished in this way. 

The matrix 



is singular, Using 10-digit accuracy, the calculated LU de- 
composition is 

L 0.3333333333 1 10"^J 



which is the decomposition of the nonsingular matrix 



-i 



0,9999999999 



] 



Hence the calculated determinants of B and D are identical. 
On the other hand, the matrix 



n 3 -I 

1 0.9999999999 J 

■ ' "Ip 3 1 

m 1/3 1 _ -10 -^J 



I I 



is nonsingular. Using 10-digit accuracy, the calculated LU 
decomposition is 

«,.[ ■ "if 3 3 i 

0,3333333333 1 _ _ 



333333 

u ■Inch is the LU decomposition of the singular matrix 

3 3 

L9999999999 0.9999999999 



C = 



[. 



] 



The calculated determinants of A and C are also identical, 
Because the calculated LU decompositions of some sin- 
gular and nonsingular matrices are identical, any test for 
singularity based upon a calculated decomposition would 
be unreliable- Some singular matrices would fail the test 
and some nonsingular ones would pass it. Therefore, no 
such test is built into the HP-15C. 

Instead, if a calculated diagonal ehfintjut n| U. which we 
call a pi vol. is found to be zero during the LU decomposi- 
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tion, rather than aborting the matrix calculation and re- 
porting the input matrix to be singular, the HP-15C replaces 
the zero pivot by a small positive number and continues 
with the calculation. This number is usually small com- 
pared to the rounding errors in the calculations. Specifi- 
cally ; it will be about 1 " l ° t imes the largest absol ute value 
of any element in that column of the original matrix, If 
every element in that column of the original matrix has an 
absolute value less than 10 ~ B9 , the value 10 ~ 39 is used 
instead. 

An advantage of replacing zero pivots by nonzero pivots 
is that matrix inversion and system solution calculations 
will not be interrupted by zero pivots. This is especially 
useful in applications such as calculating eigenvectors 
using the method of inverse iteration. Example programs 
calculating eigenvalues and eigenvectors can be found in 
reference 3. 

The effect of rounding errors and possible intentional 
perturbations causes the calculated decomposition to have 
all nonzero pivots and to correspond to a nonsingular ma- 
trix usually identical to or negligibly different from the 
original matrix. 

Complex Matrix Calculations 

The HP-15C only operates on real matrices, that is, ma- 
trices with real elements, However, it is possible to repre* 
sent complex matrices as real matrices and to perform ma- 
trix addition, subtraction, multiplication, and inversion of 
complex matrices and to solve complex systems of equa- 
tions using these real representations. 

Let Z = X + iY denote a complex matrix with real part X 
and imaginary part Y, both real matrices, One way to repre- 
sent Z as a real matrix is as the partitioned matrix 



■i'] 



having twice the number of rows but the same number of 
columns as Z, Complex matrices can be added or subtracted 
by adding and subtracting such real representations. 
Another computationally useful real representation for Z 



is 



[i -i] 



having twice the number of rows, and columns as Z, The 
HP-15C*s built-in matrix operation MATRIX 2 performs the 
transformation 

Z p -* Z 

The operation MATRIX 3 performs the inverse transforma- 
tion 



Z p 



Suppose A, B, and C are complex matrices and A 



invertible. Then complex matrix multiplication, inversion, 
and system solution can be performed with real matrices 
and built-in HP-15C operations using the relationships; 



P - P 
(AB) - AB i 



(A" 1 ) - (A.) , 



A C = B -> C p = (A] B*\ 



These procedures are illustrated in the HP-15C Owner's 
Handbook, 

Matrix Transpose 

The operations MATRIX 2 and MATRIX 3 perform their 
transformations using a matrix transpose routine. The rows 
and columns of a matrix are interchanged to form the trans- 
pose of that matrix. The transformation is performed in 
place, replacing the original matrix by its transpose. This 
routine is available to the user as MATRIX 4. Consider the 
following example: 



a 


b 


c 




a 


d 


d 


e 


f 


** 


b 
c 


e 
f 



Here the elements of the matrices have been displayed in 
a two-dimensional format, However, they are stored in a 
one-dimensional sequence within the calculator's memory. 
For this example, the transpose operation changes the or- 
dering of the elements within the calculator memory as 



t 



d b 



r 



The MATRIX 4 operation moves the elements according to 



o 




o 



These mo%'ements form disjoint loops. The first value in 
the sequence is the first candidate for moving. As a value is 
copied into its destination, that destination is tagged in its 
XS field. The previous value at that location is the next 
candidate for moving. Movement along a loop continues 
until a destination is encountered that is already tagged, 
The content of the tagged destination is not changed and 
the current loop is terminated. The value in the location 
immediately following that tagged destination is the next 
candidate for moving. 

This operation continues moving values along loops 
until the sequence is exhausted, at which point all destina- 
tion tags are removed. Finally, the recorded dimensions of 
the matrix are switched. 
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Accuracy of Matrix Calculations 

Accuracy specifications for all matrix operations are 
given in reference 3, These specifications are stated in terms 
of both backward and forward error analysis. Reference 3 
includes a general rule of thumb for the number of signifi- 
cant digits in a calculated matrix inverse or system solution, 
ft also includes descriptions of techniques to improve upon 
the accuracy of calculated system solutions and to reduce 
the ill-conditioning of systems of equations. 
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A Pocket Calculator for Computer 
Science Professionals 

This compact, yet powerful pocket calculator is designed 
for technical professionals working in computer science 
and digital electronics, Boolean operations and bit 
manipulation are some of its capabilities. 

by Eric A. Evett 



LOGIC DESIGN and computer programming require 
mathematical operations not ordinarily provided by 
small calculators. A large amount of tedious paper- 
work is often required to convert among number bases, 
perform logic operations, shift and rotate bits in a word, or 
check processor instruction flow. To simplify such work, 
Hewl ett-Packard recently introduced a programmable pock- 
et calculator especially designed for people who deal with 
bits. The HP-16C (Fig. 1], like other HF calculators, uses a 
reverse-Polish- notation [RPN) system and provides stan- 
dard floating-point decimal arithmetic [including square 
root). Its novel capabilities become apparent; however, 
when the HP-16C is switched into the integer mode, Only 
integers are allowed in this mode, and they can be keyed in 
and displayed in either hexadecimal, octal, binary, or dec- 
imal format. In this mode, number base conversion, integer 
arithmetic, logical operations, and bit manipulations can be 
done. 



Integer Mode 

In the integer mode, all numbers are represented inter- 
nally in binary form. The word size is selected by the user 
and can range from 1 to 64 bits. The user also can select 
whether the numbers are to be interpreted as one's com- 
plement, two's complement, or unsigned integers, In the 
unsigned integer mode with a 64- bit word size, numbers up 
to 2 64 " 1 [18,446,744,073,709,551,615) can be represented, 
Although the HP-16C normally displays the eight least- 
significant digits of a number, a scrolling capability is pro- 
vided to display higher-order digits. 

Programming 

In addition to the four- register RPN stack, 203 bytes of 
user memory are available for storing program steps and use 
as storage registers. When the program memory is cleared, 
all 203 bytes are allocated to storage registers. The number 
of storage registers available depends on the selected word 




Fig. 1. The HP-T6C Programma- 
ble Calculator is designed for 
computer science and digital elec- 
tronics applications. Besides the 
norma! four-function calculator 
features, it has a number of 
capabilities for setting number 
bases and word sizes, performing 
Boolean operations, and ma- 
nipulating bits. 
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Real (Floating-Point) Format 



Real numbers are represented in the HP 3000 memory by 32 
iS^bit words) separated into three 
These fteids are the sign, the exponent and r 
format is known as excess 256 Thus, a real number cor ? 
1): 
(3), bit Oof me first 1 6-bit word Positive=0, negate = 
value X and its negative —X differ onJy in the value of the sign 

■ Expr -- e bits 1 through 9 of the first l&bit word. The 

exponent ranges from to 777 octal (511 decimal) Thus 

number represents a binary exponent biased by 400 octal {256 

nrtai ) The true exponent, therefore is E -256 it ranges from 

-256 to +255, 

» Fraction IF), a binary number of the form 1 xxx, where xxx 
represents 22 bits stored in bits 1 through 1 5 of the first 16-bit 
word and all bits of the second 16-bit word. Note that the 1 
is not actually stored, there is an assumed 1 to the left of the 



binary point Floating- point zero is the only exception ft is 
represented by all 32 bits being zero 

mge ot nonzero real values for this format is from 0.86361 7 
xlQ-^toG 115792C e dec- 

imal value of a floating-point representation Is: Decimal value = 
-* ■ :_■ -■ cF 

Sign (Bit 0) 



tiponent 

I Bits 
(Bits 1 to 9) 10 to 15} 



iBrtsOto 15) 



First 1&-ert Word 



Second 16- Bit Word 



Fig. 1. Dmgram of real (floating-point) format used in the HP 
3Q0G. 



Set twos complement mode. 

Convert to octal integer mode, 
and return Integers y and x 
such that 2 J y = original input. 

Leading zeros will be displayed 

Was Input 0? 

If yes, then branch to Label 1. 



Bias exponent. 287-256+31 

Store biased exponent in index register. 

Swap exponent and mantissa. 

Set flag 0. 

Mantissa negative? 

If yes, clear flag 0. 

Absolute value of mantissa. 



Create mask of 23 bits, left-justified 

Extract upper 23 btta* 

Did round cause a carry-out 
Of most significant bit? 

If yes, Increment exponent. 

Shift off implied 1 bit. 

Recall biased exponent. 

Concatenate exponent to fraction part. 

Is mantissa sign to be positive? 

It yes, branch to Label 1 . 



Set the sign bit. 



Rotate sign, exponent, and fraction 
to proper position, 



Set word size to 32. 



Round mantissa to 23 bits. 



Store the 32 -bit resull in register 0. 



Change word size to 1 & bfts, 
i Recall 16-blt word 1. 
Recall 16-bit word 2. 



Fig. 2. Outline of HP-16C sub- 
routine to convert numbers given 
in the HP 300Q Computer's rear 
format to decimal floating-point 
format. 
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Using the HP-16C 



Listing the features of a programmable calculator rarely pro- 
vides a complete picture of its capabilities. Examples of the appli- 
cation of the calculators features are often required to de- 
monstrate to the user what can be done and why a particular 
feature is useful. Hence, several examples of the use of the 
HP-16C are given below. 

Add with Carry 

The HP-16C can be programmed to simulate instructions 
commonly found in commercial processors The following sub- 
routine performs an add with carry (Y+X+C-*X). H adds the 
numbers in registers X and Y along with the carry bit indicated by 
the state of ffag 4 and returns the result in the X register The carry 
flag is set (indicated by the C annunciator in the display) if there is 
a carry-out of the most significant bit of the result, 



001 
002 
003 
004 

005 
006 
007 
008 
009 
010 
011 



LBL A 



RLC 

+ 

CFO 
F?4 
SFO 

+ 
F7 
SF4 
RTN 



Labels subroutine 

Generates or 1 depending on carry f 
Adds carry to second operand 

Copies carry flag 4 to flag 

Adds first operand to the toial 

Sets carry flag if first add carried 



To use this routine, enter the two operands in registers Y and X. 
and press GSB A 



Example: 
2S 
HEX 



Set two's complement mode 
Set number base to hexadecimal 



WSIZE Set word size to 8 

CF 4 Clear carry flag 

FE First operand 

ENTER Enter first operand into Y register 

72 Second operand 

GSB A Displays 70 (FE + 72+0) 

with carry set (C annunciator on) 

Brt Extraction 

The following subroutine extracts a field from a bit pattern. The 
field is specified by the bit numbers of the pattern corresponding 
to the lowest-order and highest-order bits of the field. The least- 
significant bit of the bit pattern is bit numbarO. Hence, the result in 
the X register is the bits of the pattern in the Z register from the bit 
number in the Y register to the bit number in the X register, 
inclusive, 



Labels subroutine 

Bring down value in Y register 

Right-justifies field 

Raise stack 

Recall Y value 

Subtract Y from X 



001 


LBL B 


002 


FU 


003 


RRn 


004 


R] 


005 


LSTx 


006 


- 


O07 


1 


008 


+ 


009 


MASKR 


010 


AND 


011 


HEX 


012 


RTN 



Computes number of bits m field 
Creates mask same width as Held 
Extracts field 
Exits in the hexadecimal mode 



size. When the word size is eight bits, 203 registers are 
available; a 16-bit word size results in 101 available regis- 
ters, and so on. Each programmable instruction takes one 
byte of memory. As program steps are Inserted, the number 
of available storage resist ers ilei., teases. A program can have 
up lo 203 steps if no storage registers are required. 

Editing capabilities to make program development easier 
include insert, delete, back-step, single-step, and go-to- 
line-number operations. The user may single- step through 
program execution to help debug programs. Other pro- 
gramming features include label addressing (sixteen 
labels), subroutines (up to four levels deep], conditional 
tests, branching, and six user flags. 

These Hags can be set, cleared, and tested under program 
control. Three of the flags are special. Leading zcto digits in 
a word are suppressed in the display unless flag 3 is set. Flag 
4 is the carry flag, and flag 5 is the overflow flag. Two 
annunciators in the display [C for carry and G for > largest 
representable number) give a visual indication of the 
state of flags 4 and 5, respectively. The overflow flag is 
set if the true result of an operation cannot be represented 
in the selected word size and complement mode. The 
carry flag is set under various conditions, depending on 
the operation, For example, addition sets the carry flag 
if there is a carry-out of the most significant bit; other- 
"hi- carry is cleared (see box above for examples). 
The shift-left instruction sets the carry if a 1 bit is shifted 



off the left end of the word; otherwise the carry is cleared. 

Logic Operations 

The rich selection of bit manipulation and logical opera- 
tions, along with user- selectable complement mode and 
word size f make the HP-16C a flexible logic and program 
design tooL Programs can be written to simulate individual 
instructions commonly found on commercial processors, to 
extract a field from a bit pattern, or to convert from one 
numeric format to another, 

A common problem is the conversion between the inter- 
nal binary floating-point format of a particular machine and 
decimal floating-point format. The HP-16C provides a fea- 
ture that can be used to great advantage in programs de- 
signed to perform such conversions. This feature provides a 
mode for performing standard decimal floating-point cal- 
culations. Upon switching from the Integer mode to deci- 
mal floating-point mode (by using the FLOAT function), the 
integers y and x in the V and X stack registers are converted 
to the floating-point equivalent of 2*y, which is then placed 
in the X register and displayed. Converting back to integer 
mode (by pressing the HEX, DEC. OCT. or BIN keys], causes 
the contents of the X register to be converted to a pair of 
integers y and x such that y is a 32-bit integer (2 31 ^s | y | <2 32 
unless y=0) and 2*y is equal to the value in the X register 
before mode conversion. The integers y and x are then 
placed in the Y and X registers. 
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To use this routine, tm user places the bit pattern in register Z f the 
vest-orde + ie!d tfi register Y. and the 

number of the highest-orde < The user 

then presses ©SB B 

I -tract bits 2 through 5 tYom 39, 6 (001 111301 } 
s 

.vordstze to 9 
Set hexadecimal mode 
Sit pattern 



WSiZE 
HEX 

39 

ENTE* 

2 

ENTER 

5 

GSB B 



Lowest-order oil 

est-order bit 
Displays E (TITO) as result. 



Conversion Between Binary and Gray Code 

Gray code has the property that only one bit changes between 
the representations of any two adjacent numbers If the word size 
is n bits, then binary-to- Gray-code conversion is given by 

G a ^ B XORB, 

G, = B.XOR B 2 



G n - 2 = B n _ 2 xoHB„_, 

where G is the Gray code number, B is the binary number, and 
subscript indicates the [east-significant btt of G and B. subscript 
1 indicates the next least-significant bit, and so forth 



The Gray-oode-to-binary converston is given by 
B g = G^ xofl G* xqr -. - - G 

- G. xcrG 2 xor * * * G 

Binary- lo- Gray-code subroutine: 

001 LBL C 

002 ENTER 

003 SR 



004 

005 



XOR 
RTN 



Copses binary number to Y register 
Shifts binary number in X register to 
the nght 

Computes Gray-code equt. 



Gray-to- binary-code subroutine: 



001 
002 
003 
004 

005 
006 

007 
008 
009 
010 



LBL D 
ENTER 
LBL 2 
SR 

XOR 

LSTx 
X^0 
GT0 2 
Rl 
RTN 



Copies Gray code number to Y register 

Shift Gray code number m X register to 

the right 
Exclusive OR operation 
Recall previous number 
Loop until Gray code number is 



To use these routines, the user sets The HP-TSC to the binary 
mode by pressing BIN, places the number in the X register and 
presses GSB c for binary- to-Gray or GSB D for Gray-to- binary- 
code conversions 



Set hexadecimal base mode. 
Set twos complement mode. 



Extract biased exponent part. 
Recover traction part. 



Set word size to 32 bits. 
Swap word 1 and word 2. 



Shift word 1 16 bits to left 
Concatenate word 2 to word 1. 
Shift sign bit left inlo carry flag. 



Set bit 23 (the Implied 1-blt). Bit 
is the toast significant. 



Test carry flag. Was sign bit negative? 
If yes, complement mantissa. 
Arithmetic shift right mantissa t btt. 
Swap mantissa and exponent part 



Rotate exponent part 9 bits left, 
placing It at right end. 



Is Input 07 

If yes, branch to Label 0- 



Create mask of 23 bits, right-Justified. 
Extract fraction pari 



Unblas exponent. E 278 =e -256 -32 



Compute 2*y 



Fig. 3, Outline of HP-16C sub- 
routine to convert decimal 

floating-point numbers into the 
format used by HP 3000 Comput- 
ers. 
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The subroutines listed in Fig. 2 and Fig. 3 convert be- 
tween the HP SOflO Computers FORTRAN real [floating- 
point) format 2 and decimal floating-point format (see box 
on page 37)- Because the HF-lftC views bit as the least 
significant bit of a word and the HP 3000 views it as the 
most significant bit. some of the steps listed in Fig. 2 and 
Fig. 3 are used to convert between these two opposing 
views. 

To use these programs after they are entered in the HP- 
16C T a user performs the following steps, 
a HP 3000 to decimal: 

i. Select octal base (OCT), 

2, Enter word 1 in the Y register and word 2 in the X 
register, 

3, Execute GSB B. Answer is displayed- 

4, Repeat steps 1,2, and 3 for each new conversion. 
i Decimal to HP 3000: 

1, Select decimal floating-point mode (FLOAT 4). 

2. Enter number in the X register. 

3, Execute GSB A. Word 1 is placed in the Y register, 
word 2 in the X register. 

4. Repeal steps 1, 2, and 3 for each new conversion. 
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CORRECTION 

in [he March issue, the Pascal statements at the top of the back page were printed in ine 
wrung order Here >s the correct version 

buff er [ w T ] : - getch ; 
c: =c -Ha [buffer [w]J: 
w;=w+l; 
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